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SEQUENCE LISTING 



v!iC> Chen, Ruoping 
Dang, Huong T. 
Liaw, Chen W. 
Lin, I-Lin 

<120> Human Orphan G Protein Coupled Receptors 

-■: 1 3 0 > AREN0050 
-■ 1 4 0 > 09/47 1, 044 

• : } : • 1999-10-19 

■*15 0> 60/109,213 
---151> 1998-11-20 
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1 3 1 > 1999-02-16 
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- Ill ■ 1 999-02-26 

• . I 5 0 > 60,123,946 
•:'15I> 1999-03-12 

■■ 1 7 0 > 60/123,949 
<151> 1999-03-12 

1 5 0 > 60/136,436 
<151> 1999-05-28 

150> 60/ 136, 437 
15 1> 1 999-05-28 

- 1 5 0 > 60/136,439 
■-151> 1999-05-23 

■. 1 50> 60/ 136, 567 
■.:51> 1 999-05-28 

■ ■ I 3 0 > 60/ 137, 127 
v!31> 1999-05-28 

15 0> 60/ 137,1 31 
■■' 1 5 1 > 1 999-05-28 

«-:-30> 6 0 / 1 4 1 , 4 4 8 




. > . 9 9 r 1 - rj - ^ 9 

9> 60/156, 653 

,1> 19 99-09-2 9 

:> 60/156, 633 

. 1> 19 99-0 9-2 9 



■- 1 3 I > 19^9-09-29 

■vl00> 60/ 156, 634 

■:151> 1 999-09-29 

■:150> 60 / 1 5 7,29 0 

■.151> 1999-10-01 

150> 60/15", 2 94 

■ :b;-> 1999-10-01 

■150> 60/157, 281 

■ ■ ! 5 1 > 19^9-10-01 

■100> 60/157,293 

■■ L 3 I > 1999-10-01 

■-;50> 60/15", 232 

<'151> 1999-10-01 

<160> 7 4 

■■' 1 " 0 > Patent In Ver. 2.1 

<2 1Q> 1 

<" 2 1 1 > 12 60 

■■212> ONA 

•■2 13> Hcrr.o sapiens 



iti-^tzttct cggcagtgtt gactgcgttc cataccggga catccaacac aacatttgtc 60 

gtgtatqaaa acacctacat gaatattaca ctccctccac cattccagca tcctgacctc 120 

j'jt:;at"gc ttagatatag ttttgaaacc atggctccca ctggtttgag ttccttgacc 180 

itigaatagta cagctgtgcc cacaacacca gcagcattta agagcctaaa cttgcctctt 240 

:agatcaccc tttctgctat aatgatattc attctgtttg tgtcttttct tgggaacttg 300 

gt r -gr.t tgoc tcatggttta ccaaaaagct gccatgaggt ctgcaattaa catcctcctt 360 

gccagcctag cttttgcaga catgttgctt gcagtgctga acatgccctt tgccctggta 420 

Krtattctta ctacccgatg gatttttggg aaattcttct gtagggtatc tgctatgttt 480 

f.ctggttat ttgtgataga aggagtagcc atcctgctca tcattagcat agataggttc 540 
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.1 ^ 


- a t d ) 


c ; agaggca 


g g a t a a g c t a 


aacccatata 


g a g c t a a g g t 


tctgat tgea 


60 0 


7t t 




ttggg 


caacttcctt 


t t gtgtagct 




ccgtaggaaa 


ccccgacctg 


660 






acct t 


cccgagct cc 


ccagt gtgtg 


tttgggtaca 


caaccaat cc 


aggct accag 


720 


j o t 


t a 


tgtga 


ttttgatttc 


tctcatttct 


ttcttcatac 


cct tcctggt 


aat act gtac 


780 


t r 3 


■j- 


tatgg 


gcat actcaa 


caccct tegg 


cacaa t gcct 


tgaggatcca 


tagct accct 


840 


"I ~i 




tatat 


gcct cagcca 


ggccagcaaa 


ctgggt ct ca 


t gagtctgea 


gagacc tt tc 


900 


1 j 




::a^ca 


1 1 gacatggg 


ct ttaaaaca 


cgtgcct tea 


ccactatttt 


gattctcttt 


960 


v.'t 


71- 


:; 1 1 r- a 


t tgt ct get g 


ggccccat t c 


accact taca 


gccttgtggc 


aacat t cagt 


1020 




:a 


j 1 1 1 1 


a c t a t c a g c a 


caactt ttt t 


gagatt agca 


cct ggctact 


gtggctctgc 


1080 


y : a c 


i- 


c a a g t 


o tgcat tgaa 


tccgctgatc 


t actactgga 


ggat taagaa 


att ccatgat 


1140 


J - - 




cctgg 


acatgatgee 


taagtccttc 


aagtttt tgc 


cgcagctccc 


tggtcacaca 


1200 


.5 a g 


cqacgga 


tacgtcctag 


t gctgtctat 


gtgtgtgggg 


aacateggae 


ggtggtgtga 


1260 



: : > a 19 

*-212> FRT 

••"21 3 > H o mo sapiens 



Met Val ?he Ser Ala Val Leu Thr Ala Phe His Thr Gly Thr Ser Asn 

5 10 15 

Thr Thr Phe Val Val Tyr Glu Asn Thr Tyr Met Asn lie Thr Leu Pro 

20 25 30 

•'" r o Pro Phe Gin His Pro Asp Leu Ser Pro Leu Leu Arg Tyr Ser Phe 
35 40 45 

Glu Thr Met Ala Pro Thr Gly Leu Ser Ser Leu Thr Val Asn Ser Thr 

3 0 5 5 60 

-.la Val Pro Thr Thr Pro Ala Ala Phe Lys Ser Leu Asn Leu Pro Leu 

hb 70 75 80 

Gin lie Thr Leu Ser Ala lie Met lie Phe lie Leu Phe Val Ser Phe 
85 90 95 

[.':■: Gly Asn Leu Val Val Cys Leu Met Val Tyr Gin Lys Ala Ala Met 
100 105 110 

Ser Ala lie Asn lie Leu Leu Ala Ser Leu Ala Phe Ala Asp Met 
115 120 125 

:.ou Leu Ala Val Leu Asn Met Pro Phe Ala Leu Val Thr lie Leu Thr 

130 135 140 

7hr Arg Trp lie Phe Gly Lys Phe Phe Cys Arg Val Ser Ala Met Phe 
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- • ' - HO 15 5 160 

Phe Trp Leu Fhe Val lie Glu Gly Val Ala lie Leu Leu lie lie Ser 
165 170 175 

:H Aop Arg Phe Leu He He Val Gin Arg Gin Asp Lys Leu Asn Pro 
180 18 5 190 

:vr Arg Ala Lys Val Leu He Ala Val Ser Trp Ala Thr Ser Phe Cys 

195 200 205 

Val Ala Phe Pre Leu Ala Val Gly Asn Pro Asp Leu Gin He Pro Ser 

2 10 215 2 20 

Arg Ala Fro Gin Cys Val Phe Gly Tyr Thr Thr Asn Pro Gly Tyr Gin 

225 230 235 240 

Ala Tyr Val lie Leu He Ser Leu He Ser Phe Phe He Pro Phe Leu 
245 250 255 

Val He Leu Tyr Ser Phe Met Gly He Leu Asn Thr Leu Arg His Asn 

260 265 270 

Ali Leu Arg lie His Ser Tyr Pro Glu Gly He Cys Leu Ser Gin Ala 
275 280 285 

Ser Lys Leu Gly Leu Met Ser Leu Gin Arg Pro Phe Gin Met Ser He 
290 295 300 

Asp Met Gly Phe Lys Thr Arg Ala Phe Thr Thr He Leu He Leu Phe 

H5 310 315 320 

Ala Val Phe He Val Cys Trp Ala Pro Phe Thr Thr Tyr Ser Leu Val 
325 330 335 

Ala Thr Phe Ser Lys His Phe Tyr Tyr Gin His Asn Phe Phe Glu He 
340 345 350 

G-r Thr Trp Leu Leu Trp Leu Cys Tyr Leu Lys Ser Ala Leu Asn Pro 

"355 360 365 

L-.-u He Tyr Tyr Trp Arg He Lys Lys Phe His Asp Ala Cys Leu Asp 

3^0 37 5 380 

Vet Met Pro Lys Ser Phe Lys Phe Leu Pro Gin Leu Pro Gly His Thr 
"i«5 390 395 400 

Lys Arg Arg lie Arg Pro Ser Ala Val Tyr Val Cys Gly Glu His Arg 
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425 




410 




4 15 




Thr 7a 1 7a 

^ _ ■., - j 














■2 11 > 1119 
■ 2 12 .> DMA 














•■' 2 1 3 > Mono 


sapiens 












■:4 0C> 3 














itqttagcca 


acagctcctc 


aaccaacagt 


t ctgt tctcc 


cgt gt cctga 


ctaccgacct 


60 


acc:a:cgcc 


tgcact tggt 


ggt ct acagc 


t tggtgctgg 


ctgccgggct 


ccccctcaac 


120 


jcgctagccc 


t er.gggt ctt 


cct gcgcgcg 


ct gcgcgt gc 


act eggt ggt 


gagcgtgtac 


180 


■i t g t g t a a c c 


tggcggccag 


cgacct get c 


t tcaccctct 


cgctgcccgt 


tcgtctctcc 


240 


f _ a c t a c g c a c 


tgcaccactg 


gcccttcccc 


gacct cctgt 


gccagacgac 


gggegecat c 


300 


v. t; :::aqarga 


acat gt aegg 


cagct gcatc 


tt ectgatge 


t catcaacgt 


ggaccgctac 


360 


.gc cqccat eg 


t gcacccgct 


gcgactgcgc 


cacct gegge 


ggccccgcgt 


ggegegget g 


420 


ctctgcctgg 


gcgtgtgggc 


get catcctg 


gtgtt tgccg 


t gcccgccgc 


ccgcgtgcac 


480 


•- i : ' s : :: 


gt t gecget a 


ccgggacctc 


gaggtgegee 


t at gcttcga 


gagct tcagc 


540 


gacgagctgt 


gq a a a ggcag 


get gctgccc 


ctegt getge 


t ggecgagge 


get gggctt c 


600 


--getgee cc 


t ggcggcggt 


ggt ct act eg 


t egggecgag 


t ct tctggac 


get ggegege 


660 


":'cqacgc;:a 


cgcagagcca 


geggeggegg 


aagaccgtgc 


gcctcct get 


ggctaacct c 


720 


Itoatcttcc 


tqctgtgctt 


cgtgccct ac 


aacagcacgc 


t ggcggt eta 


egggctget g 


780 


e g qaqcaagc 


tggtggcggc 


cagegt gect 


gcccgcgatc 


gcgtgcgcgg 


ggt gctgatg 


840 


jt gatggtgc 


tget ggcegg 


cgccaact gc 


gtgctggacc 


cgctggt gta 


ctactt tagc 


900 


gecgaggget 


tccgcaacac 


cctgcgcggc 


ctgggcactc 


cgcaccgggc 


caggacctcg 


960 


gccaccaacg 


ggacgeggge 


ggcgctcgcg 


caatccgaaa 


ggtccgccgt 


caccaccgac 


1020 


g c c a c caggc 


eggat geege 


cagt cagggg 


ct gctccgac 


cctccgactc 


ccact ct ct g 


1080 


tct test tea 


cacagt gt cc 


ccaggatt cc 


geect ctga 






1119 



-:2 10> 4 

2 1 1 > 37 2 

<2\2> ?RT 

■ ' 2: 2 3 > Homo sapiens 



Met Leu Ala Asn Ser Ser Ser Thr Asn Ser Ser Val Leu Pro Cys Pro 

15 10 15 

Asp Tyr Arg Pro Thr His Arg Leu His Leu Val Val Tyr Ser Leu Val 

20 25 30 

Leu Ala Ala Gly Leu Pro Leu Asn Ala Leu Ala Leu Trp Val Phe Leu 

35 40 45 
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Y 



A: : A La l.ou Arg Val His Ser Val Val Ser Vai Tyr Met Cys Asn Leu 

10 5 5 60 

Ala Ala Ser Asp Leu Leu Phe Thr Leu Ser Leu Pro Val Arg Leu Ser 

0 5 7 0 7 5 8 0 

Tyr Tyr Ala Leu His His Trp Pro Phe Pro Asp Leu Leu Cys Gin Thr 
85 90 95 

Thr Giy Aid lie Phe Gin Met Asn Met Tyr Giy Ser Cys lie Phe Leu 

100 105 110 

Met Leu lie Asn Val Asp Arg Tyr Ala Ala lie Val His Pro Leu Arg 
115 120 125" 

ieu Arg Wis Leu Arg Arg Pro Arg Val Ala Arg Leu Leu Cys Leu Gly 
130 135 140 

Val Trp Ala Leu lie Leu Val Phe Ala Val Pro Ala Ala Arg Val His 
145 150 155 160 

Arg Pro Ser Arg Cys Arg Tyr Arg Asp Leu Giu Val Arg Leu Cys Phe 
165 170 175 

; . .; Ser Phe Ser Asp Giu Leu Trp Lys Giy Arg Leu Leu Pro Leu Val 
180 18 5 190 

Leu Leu Ala Giu Ala Leu Gly Phe Leu Leu Pro Leu Ala Ala Val Val 
195 200 205 

Tyr Ser Ser Giy Arg Val Phe Trp Thr Leu Ala Arg Pro Asp Ala Thr 

210 215 220 

Gin Ser Gin Arg Arg Arg Lys Thr Val Arg Leu Leu Leu Ala Asn Leu 
225 230 235 240 

Val lie Phe Leu Leu Cys Phe Val Pro Tyr Asn Ser Thr Leu Ala Val 
245 250 255 

Tyr Giy Leu Leu Arg Ser Lys Leu Val Ala Ala Ser Val Pro Ala Arg 
260 265 270 

A op Arg Vai Arg Gly Val Leu Met Val Met Val Leu Leu Ala Gly Ala 
?-7 5 280 285 

Asn Cys Val Leu Asp Pre Leu Val Tyr Tyr Phe Ser Ala Giu Gly Phe 

2 90 295 300 
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A:-.; Asr; Tr.r Leu Arg Gly Leu Giy Thr Pro His Arg Ala Arg Thr Ser 

>05 310 315 320 

Ala Thr Asn Gly Thr Arg Ala Ala Leu Ala Gin Ser Glu Arg Ser Ala 

325 330 335 

: , . Thr Thr Asp Ala Thr Arg Pro Asp Ala Ala Ser Gin Gly Leu Leu 

34 0 34 5 350 

A:g Pro her Asp Ser His Ser Leu Ser Ser Phe Thr Gin Cys Pro Gin 

'35 5 360 365 



■■■2 10> 5 

■-: 211 ■> 110 7 

■ 2 1 2 > LNA 

2 I i > Homo sapiens 



arggccaact ccacagggct gaacgcctca gaagtcgcag gctcgttggg gttgatcctg 60 

g.oagcigtcg vggaggt ggg ggcactgctg ggcaacggcg cgctgctggt cgtggtgctg 120 

gactgcgcga cgcgctctac ctggcgcacc tgtgcgtcgt ggacctgctg 180 

gcggccgcct cca:catgcc gctgggcctg ctggccgcac cgccgcccgg gctgggccgc 240 

::. gcgcctgg gooocgogoc atgccgcgcc gctcgcttcc tctccgccgc tctgctgccg 300 

gcctgcacgc tcggggtggc cgcacttggc ctggcacgct accgcctcat cgtgcacccg 360 

•* : :.- ; a g gctcgcggcc gccgcctgtg ctcgtgctca ccgccgtgtg ggccgcggcg 4 20 

ggactgctqg gcgcgctctc cctgctcggc ccgccgcccg caccgccccc tgctcctgct 4 80 

• : • • : ; • : : : tcctggctgg gggcctcggg cccttccggc cgctctgggc cctgctggcc 540 
• • oogccctcct gctgctcggc gcctacggcg gcatcttcgt ggtggcgcgt 600 
ogogongcoc tqaggccccc acggccggcg cgcgggtccc gactccgctc ggactctctg 660 
gaiagccgcc tttccatctt gccgccgctc cggcctcgcc tgcccggggg caaggcggcc 720 
or ggccccag cgctggccgt gggccaattt gcagcctgct ggctgcctta tggctgcgcg 780 
-gcctggogc cogcagcgcg ggccgcggaa gccgaagcgg ctgtcacctg ggtcgcctac 84 0 
icggccttcg oggctcaccc cttcctgtac gggctgctgc agcgccccgt gcgcttggca 900 
o- -jggccgcc totctcgccg tgcactgcct ggacctgtgc gggcctgcac tccgcaagcc 960 

" yy:icccqc gggcactctt gcaatgcctc cagagacccc cagagggccc tgccgtaggc 1020 

• - •• : : v] ctccagaaca gacccccgag ttoggcaggag ggcggagccc cgcataccag 1080 
Tgooacctg agagttctct ctcctga 1107 



2 I 0 ;> n 

211.* 368 

2 12> PRT 

2 13 > Homo sapiens 
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'" : I y Leu lie Leu 
20 

Gly Ala Leu Leu 



Leu Tyr Leu A^a 

5 0 

! e Mot Pro Leu 

0 3 

V . 1 Arq Leu Gly 



A. a Leu Leu Pro 

100 

Arg Tyr Arg Leu 
115 

Fro Val Leu Val 



Ala Leu 3er Leu 
14 5 

Arq Cys Ser Val 



Ala Leu Leu Ala 

130 

Ay Gly lie Phe 

195 

Pro Ala Arg Gly 



Ser lie Leu Fro 



Leu Ala Fro Ala 




Thr Gly Leu Asn 

5 

Ala Ala Val Val 



Val Val Val Leu 
40 

His Leu Cys Val 
55 

Gly Leu Leu Ala 

70 

Pro Ala Pro Cys 

35 

Ala Cys Thr Leu 



lie Val His Pro 
120 

Leu Thr Ala Val 
135 

Leu Gly Pro Pro 
150 

Leu Ala Gly Gly 
165 

Phe Ala Leu Pro 



Val Val Ala Arg 

200 

Ser Arg Leu Arg 

215 

Pro Leu Arg Pro 

230 

Leu Ala Val Gly 

245 



Ala Ser Glu Val 

10 

Glu Val Gly Ala 

25 

Arg Thr Pro Gly 



Val Asp Leu Leu 
60 

Ala Pro Pro Pro 

75 

Arg Ala Ala Arg 

9 0 

Gly Val Ala Ala 
105 

Leu Arg Pro Gly 



Trp Ala Ala Ala 

140 

Pro Ala Pro Pro 
155 

Leu Gly Pro Phe 
170 

Ala Leu Leu Leu 
185 

Arg Ala Ala Leu 



Ser Asp Ser Leu 
220 

Arg Leu Pro Gly 
235 

Gin Phe Ala Ala 

250 




Ala Gly Ser Leu 
15 

Leu Leu Gly Asn 

30 

Leu Arg Asp Ala 
45 

Ala Ala Ala Ser 



Gly Leu Gly Arg 

80 

Phe Leu Ser Ala 
95 

Leu Gly Leu Ala 
110 

Ser Arg Pro Pro 

125 

Gly Leu Leu Gly 



Pro Ala Pro Ala 
160 

Arg Pro Leu Trp 
175 

Leu Gly Ala Tyr 

190 

Arg Pro Pro Arg 

205 

Asp Ser Arg Leu 



Gly Lys Ala Ala 

240 

Cys Trp Leu Pro 
255 
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Tyr Gly Cys Ala Cys Leu Ala Pro Ala Ala Arg Ala Ala Glu Ala Glu 
260 265 270 

Ala Ala Val Thr Trp Val Ala Tyr Ser Ala Phe Ala Ala His Pro Phe 

2^5 280 285 

Leu Tyr Gly leu Leu Gin Arg Pro Val Arg Leu Ala Leu Gly Arg Leu 

2 10 2 95 300 

T-rr Arg .Arg Ala Leu Fro Gly Pro Val Arg Ala Cys Thr Pro Gin Ala 

^-'5 310 315 32 0 

Trp His Pre Arg Ala Leu Leu Gin Cys Leu Gin Arg Pro Pro Glu Gly 
325 330 335 

^ Pro Ala Val Gly Pro Ser Glu Ala Pro Glu Gin Thr Pro Glu Leu Ala 

34 0 34 5 35 0 



ily Gly Arg 3er Pro Ala Tyr Gin Gly Pro Pro Glu Ser Ser Leu Ser 
355 360 365 



<2il> 1003 
■:2L2> DMA 



-■■ 2 1 3 > 


Ho.^o 


sapiens 












■■ -1 n n 
*\ gga 


atoat 


ct t tctcatt 


t ggagtgatc 


ettgetgt cc 


tggect ccct 


catcat tget 


60 


« o t a a 


cacac 


tagt ggct gt 


ggctgt gctg 


ctgttgat cc 


acaagaat ga 


tggtgt cagt 


120 


ctctg 


o 1 1 o a 


ccttgaatct 


ggct gtggct 


gacacctt ga 


t tggt gtggc 


catctctggc 


180 


..:ract 


oacag 


accagct ct c 


cagccctt ct 


cggcccacac 


agaagaccct 


gtgcagcctg 


240 


: gat 


ggcat 


ttgtoacttc 


ct ccgcagct 


gectet gt cc 


t caeggtcat 


get gatcacc 


300 


*- +" - ^ .3 


oaggt 


acct t gecat 


caagcagccc 


t tccgctact 


tgaagat cat 


gagtgggttc 


3 60 


:* = } ' 


egggg 


ectgea tt gc 


eggget gtgg 


ttagtgtctt 


acctcattgg 


ct tcctccca 


420 


:\ :gg 


a a t c e 


ccatgt tcca 


gcagactgcc 


t acaaagggc 


agt gcagctt 


ct ttgctgta 


480 






actt cgtgct 


gaccct ct cc 


t gcgtt ggct 


t ct tcccagc 


catgctcctc 


540 


tttgt 


cttct 


tct actgega 


catget caag 


attgect cca 


tgcacagcca 


gcagatt cga 


600 


a a gat 


ggaac 


atgeaggage 


catggctgga 


ggt tatcgat 


ccccacggac 


tcccagcgac 


660 


' c :aaa get c 


tec gtactgt 


gtctgttctc 


at tgggagct 


t tget ct ate 


ct ggaccccc 


720 




t a t c a 


ct ggcat tgt 


gcaggt ggee 


t gecaggagt 


gtcacct eta 


cct agtgctg 


780 


g a a e g 


g t a c c 


tgtggctgct 


cggcgtgggc 


aactccct gc 


t caacccact 


cat ct at gec 


840 


tat t g 


gcaga 


aggaggtgcg 


actgcagctc 


taccacat gg 


ccctaggagt 


gaagaaggt g 


900 




t o a t 


tcctcctctt 


t ctctcggcc 


aggaat tgtg 


geccagagag 


geccagggaa 


960 
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m 



i - -r. : r • i : tatctccagc tcagagtttg atggctaa 



IOCS 



2H> 3 3 5 

:".;;:> prt 

,H 3 > Hcr.o sapiens 



Met : 3iu Ser Ser Fhe Ser ?he Gly Val He Leu Ala Val Leu Ala Ser 

5 10 15 

Leu He He Ala Thr Asn Thr Leu Val Ala Val Ala Val Leu Leu Leu 

20 25 30 

lie His Lys Asn Asp Gly Val Ser Leu Cys Phe Thr Leu Asn Leu Ala 
3 5 4 0 4 5 

Val Ala Asp Thr Leu He Gly Val Ala He Ser Gly Leu Leu Thr Asp 

50 55 60 



;ln Leu Ger Ger 



Fro Ser Arg Pro Thr Gin Lys Thr Leu Cys Ser Leu 

70 75 80 



Arg Met Aia Fhe Val Thr Ser Ser Ala Ala Ala Ser Val Leu Thr Val 
35 90 95 

Met. Leu He Thr Phe Asp Arg Tyr Leu Ala He Lys Gin Pro Phe Arg 
100 105 110 

Tyr Leu Lys He Met Ser Gly Phe Val Ala Gly Ala Cys He Ala Gly 
115 120 125 

Leu Trp Leu Val Ser Tyr Leu He Gly Phe Leu Pro Leu Gly He Pro 
130 135 14 0 

Met Phe Gin Gin Thr Ala Tyr Lys Gly Gin Cys Ser Phe Phe Ala Val 

145 150 155 160 

i-he His Pro His Phe Val Leu Thr Leu Ser Cys Val Gly Phe Phe Pro 
165 170 175 

Ala Men Leu Leu Phe Val Phe Phe Tyr Cys Asp Met Leu Lys He Ala 
180 185 190 



; r Met His Ser Gin Gin He Arg Lys Met Glu His Ala Gly Ala Met 

195 2 00 20 5 
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i -Ly ;; 1 y ryr Arg Ser Pro Arg Thr Fro Ser Asp Fhe Lys Ala Leu 

:;o 215 22 0 

■ } Thr Val Ser Val Leu lie Giy Ser ?he Ala Leu Ser Trp Thr Pro 

5 230 235 240 

.e Leu lie Thr Gly lie Val Gin Val Ala Cys Gin Glu Cys His Leu 
245 250 255 

r Leu Val Leu Glu Arg Tyr Leu Trp Leu Leu Gly Val Gly Asn Ser 

260 265 270 

u Leu Asn Pro Leu He Tyr Ala Tyr Trp Gin Lys Glu Val Arg Leu 
2 H 23 0 285~ 

r. Leu Tyr His Met Ala Leu Gly Val Lys Lys Val Leu Thr Ser Phe 

290 295 300 

u Leu Phe Leu Ser Ala Arg Asn Cys Gly Pro Glu Arg Pro Arg Glu 

■j 3 10 3 15 320 

r Lor Gys Mis lie Val Thr He Ser Ser Ser Glu Phe Asp Gly 

325 330 335 



. 2 10 > 9 

■■ 2 1 1 > 14 13 

■:212> LNA 

■'21 3 > Homo sapiens 
'. 4 0 0 > 9 

:* r: ; - ; ;• .i ccatggaagc tgacctgggt gccactggcc acaggccccg cacagagctt 60 

.jatgatgagg actcctaccc ccaaggtggc tgggacacgg tcttcctggt ggccctgctg 120 

.: H .ccttgggc tgccagccaa tgggttgatg gcgtggctgg ccggctccca ggcccggcat 180 

:jgagctggca cgcgtctggc gctgctcctg ctcagcctgg ccctctctga cttcttgttc 240 

itggcagcag cggccttcca gatcctagag atccggcatg ggggacactg gccgctgggg 300 

i .-agctgcct gccgcttcta ctacttccta tggggcgtgt cctactcctc cggcctcttc 360 

:t-gctqgccg ccctcagcct cgaccgctgc ctgctggcgc tgtgcccaca ctggtaccct 420 

; : joaocgcc cagtccgcct gcccctctgg gtctgcgccg gtgtctgggt gctggccaca 4 80 

-r.t'rag;;g r.gccctggct ggtcttcccc gaggctgccg tctggtggta cgacctggtc 540 

a- gccl gg acttctggga cagcgaggag ctgtcgctga ggatgctgga ggtcctgggg 600 

- .-* et t t cct cct gctgctcgtc tgccacgtgc tcacccaggc cacagcctgt 660 

• t ~ i • ' i ■ : accgccaaca gcagcccgca gcctgccggg gcttcgcccg tgtggccagg 720 

ic-iUctgt cagcctatgt ggtcctgagg ctgccctacc agctggccca gctgctctac 780 

.rtjgccttcc tgtgggacgt ctactctggc tacctgctct gggaggccct ggtctactcc 840 

^rtacctga tcctactcaa cagctgcctc agccccttcc tctgcctcat ggccagtgcc 900 

jac^.ccgga ccctgctgcg ctccgtgctc tcgtccttcg cggcagctct ctgcgaggag 960 

: Tg^gggca gcttcacgcc cactgagcca cagacccagc tagattctga gggtccaact 1020 
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", r;jqaj.; j^atggcaga ggcccagtca cagatggatc ctgtggccca gcctcaggtg 1C30 

■. : ■ ■ .1 : tccaqccacg atcggatccc acagctcagc cacagctgaa ccctacggcc 1140 

::aqccacagt cggatcccac agcccagcca cagctgaacc tcatggccca gccacagtca 1200 

jattctgtgg cccagccaca ggcagacact aacgtccaga cccctgcacc tgctgccagt 1260 

t-itgtgccca gtccctgtga tgaagcttcc ccaaccccat cctcgcatcc taccccaggg 1320 

jr. t t :^agg acccagccac acctcctgcc tctgaaggag aaagccccag cagcaccccg 1380 

'■jajaggcgg ccccgggcgc aggccccacg tga 1413 



2 1Q> 10 
211> 4 68 
\212> FRT 

■ 2 1 .5 > Homo sapiens 



Met Asp Thr Thr Met Glu Ala Asp Leu Gly Ala Thr Gly His Arg Pro 
15 10 15 

Arg Thr Glu Leu Asp Asp Glu Asp 5er Tyr Pro Gin Gly Gly Trp Asp 

20 25 30 

LLr- Val Phe Leu Val Ala Leu Leu Leu Leu Gly Leu Pro Ala Asn Gly 
3 5 4 0 4 5 

Leu Met Ala Trp Leu Ala Gly Ser Gin Ala Arg His Gly Ala Gly Thr 
50 55 60 

Arg Leu Ala Leu Leu Leu Leu Ser Leu Ala Leu Ser Asp Phe Leu Phe 

65 70 75 80 

Lou Ala Ala Ala Ala Phe Gin lie Leu Glu He Arg His Gly Gly His 
85 90 95 

Trp Pro Leu Gly Thr Ala Ala Cys Arg Phe Tyr Tyr Phe Leu Trp Gly 
100 105 110 

Val Ser Tyr Ser Ser Gly Leu Phe Leu Leu Ala Ala Leu Ser Leu Asp 
115 120 125 

Arg Cys Leu Leu Ala Leu Cys Pro His Trp Tyr Pro Gly His Arg Pro 
13 0 13 5 14 0 

'Ail Arg Leu Pro Leu Trp Val Cys Ala Gly Val Trp Val Leu Ala Thr 
14 5 150 15 5 160 

Leu Phe Ser Val Pro Trp Leu Val Phe Pro Glu Ala Ala Val Trp Trp 
165 170 175 
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: Asp L-^u Va 1 lie Cys leu Asp ?he i rp Asp Ser Glu Glu Leu Ser 

130 135 ■ 190 

u Arg Met Leu Glu Val Leu Gly Gly Phe Leu Pro Phe Leu Leu Leu 
195 200 205 

u Val Cys His Val Leu Thr Gin Ala Thr Arg Thr Cys His Arg Gin 

210 215 220 

n Gin Pro Ala Ala Cys Arg Gly Phe Ala Arg Val Ala Arg Thr lie 

5 230 235 240 

u Ser Ala Tyr Val Val Leu Arg Leu Pro Tyr Gin Leu Ala Gin Leu 
245 250 255 

u Tyr Leu Ala Phe Leu Trp Asp Val Tyr Ser Giy Tyr Leu Leu Trp 

260 265 270 

u Ala Leu Val Tyr Ser Asp Tyr Leu lie Leu Leu Asn Ser Cys Leu 

2 7 .5 280 285 

Pro Phe Leu Cys Leu Met Ala Ser Ala Asp Leu Arg Thr Leu Leu 

2^0 2 95 300 

g Sor Val Leu Ser Ser Phe Ala Ala Ala Leu Cys Glu Glu Arg Pro 

310 315 320 

y Ser Phe Thr Pro Thr Glu Pro Gin Thr Gin Leu Asp Ser Glu Gly 
325 330 335 

0 Thr Leu Pro Glu Pro Met Ala Glu Ala Gin Ser Gin Met Asp Pro 

340 345 350 

1 Ala Gin Pro Gin Val Asn Pro Thr Leu Gin Pro Arg Ser Asp Pro 

355 360 365 

r f-v i a G In Pro Gin Leu Asn Pro Thr Ala Gin Pro Gin Ser Asp Pro 

3^0 375 380 

r Ala Gin Pro Gin Leu Asn Leu Met Ala Gin Pro Gin Ser Asp Ser 

', 390 395 4 00 

1 Ala Gin Pro Gin Ala Asp Thr Asn Val Gin Thr Pro Ala Pro Ala 
405 410 415 

.a Ser Ser Val Pro Ser Pro Cys Asp Glu Ala Ser Pro Thr Pro Ser 
420 425 430 
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-is Pro Thr Pro Giy Ala Leu Glu Asp Pro Ala Thr Pro Pro Ala 
4 3 5 4 4 0 4 4 5 



'er Glu Gly Glu 5er Pro Ser Ser T; 
450 455 



Pro Pro Glu Ala Ala Pro Gly 
460 



•.la Gly Pro Thr 
165 



< 2 1 0 > 11 
<211> 1243 
<212> DNA 

<213> Mono sapiens 
<4 00> 11 

atgtcaggga tggaaaaact tcagaatgct tcctggatct accagcagaa actagaagat 60 

/\ ^ ccattccaga aacacctgaa cagcaccgag gagtatctgg ccttcctctg cggacctcgg 120 

;gcagccact tcttcctccc cgtgtctgtg gtgtatgtgc caatttttgt ggtgggggtc 180 

« attggcaatg tcctggtgtg cctggtgatt ctgcagcacc aggctatgaa gacgcccacc 240 

,■■ ■ * aactactacc tcttcagcct ggcggtctct gacctcctgg tcctgctcct tggaatgccc 300 

/ ^ -tggaggtct atgagatgtg gcgcaactac cctttcttgt tcgggcccgt gggctgctac 360 

V_yV ttcaagacgg ccctctttga gaccgtgtgc ttcgcctcca tcctcagcat caccaccgtc 420 

a gcgtggagc gctacgtggc catcctacac ccgttccgcg ccaaactgca gagcacccgg 480 

cgccgggccc tcaggatcct cggcatcgtc tggggcttct ccgtgctctt ctccctgccc 540 

aacaccagca tccatggcat caagttccac tacttcccca atgggtccct ggtcccaggt 600 

\cggccacct gtacggtcat caagcccatg tggatctaca atttcatcat ccaggtcacc 660 

tccttcctat tctacctcct ccccatgact gtcatcagtg tcctctacta cctcatggca 720 

ctcagactaa agaaagacaa atctcttgag gcagatgaag ggaatgcaaa tattcaaaga 780 

ccctgcagaa aatcagtcaa caagatgctg tttgtcttgg tcttagtgtt tgctatctgt 840 

tgggccccgt tccacattga ccgactcttc ttcagctttg tggaggagtg gagtgaatcc 900 

■orggctgctg tgttcaacct cgtccatgtg gtgtcaggtg tcttcttcta cctgagctca 960 

gctgtcaacc ccattatcta taacctactg tctcgccgct tccaggcagc attccagaat 1020 

: - gtgatctctt ctttccacaa acagtggcac tcccagcatg acccacagtt gccacctgcc 1080 

cagcggaaca tcttcctgac agaatgccac tttgtggagc tgaccgaaga tataggtccc 1140 

caattcccat gtcagtcatc catgcacaac tctcacctcc caacagccct ctctagtgaa 1200 

cagatgtcaa gaacaaacta tcaaagcttc cactttaaca aaacctga 1248 



<210> 12 
<2 11> 4 15 
<?.12> PRT 

'■'21 3> Elomo sapiens 
•■4C0> 12 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 
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Lys Leu ;iu Asp Pro Fhe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala ?he Leu Cys Gly Fro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
5 0 5 5 60 

Leu Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr 

o5 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
ICQ 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 

115 12 0 12 5 

Val Cys Phe Ala Ser lie Leu Ser He Thr Thr Val Ser Val Glu Arg 
130 135 14 0 

Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

Arg Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 
165 170 175 

Phe Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 
130 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 

195 200 205 

Fro Met Trp He Tyr Asn Phe lie He Gin Val Thr Ser Phe Leu Phe 

210 215 220 

Tyr Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
22 5 2 30 23 5 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 
245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 

260 265 270 
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Val Leu Val Phe Ala lie Cys Trp Ala Pre rhe His lie Asp Arg 

275 230 285 

Leu Phe ?he Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val 

290 295 300 

rhe Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser 
JC5 310 315 320 

Ala Val Asn Pro He lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 
325 330 335 

Ala Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 ~350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu 
355 360 365 

Vys His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys 

3 7 0 37 5 38 0 



i 3 5 



Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
390 395 400 



Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 



<210> 13 

< 2 1 1 > 1173 

<2 12> DNA 

<213> Homo sapiens 

< 4 00> 13 

atgecagata ctaatagcac aatcaattta tcactaagca ctcgtgttac tttagcattt 60 
tttatgtcct tagtagcttt tgctataatg ctaggaaatg ctttggtcat tttagctttt 120 
gtggtggaca aaaaccttag acatcgaagt agttattttt ttcttaactt ggccatctct 180 
qacttctttg tgggtgtgat ctccattcct ttgtacatcc ctcacacgct gttcgaatgg 240 
■lattttggaa aggaaatctg tgtattttgg ctcactactg actatctgtt atgtacagca 300 
- :;- gtat ata acattgtcct catcagctat gatcgatacc tgtcagtctc aaatgctgtg 360 
tot tatagaa ctcaacatac tggggtcttg aagattgtta ctctgatggt ggccgtttgg 420 
gtgctggcct tcttagtgaa tgggccaatg attctagttt cagagtcttg gaaggatgaa 480 
.j -jtagtgaat gtgaacctgg atttttttcg gaatggtaca tccttgccat cacatcattc 540 
ttggaatteg tgatcccagt catcttagtc gcttatttca acatgaatat ttattggagc 600 
otgtggaagc gtgatcatct cagtaggtgc caaagccatc ctggactgac tgctgtctct 660 
tocaacatct gtggacactc attcagaggt agactatctt caaggagatc tetttctgea 720 
tcgacagaag ttcctgcatc ctttcattca gagagacaga ggagaaagag tagtctcatg 780 
ttttsotcaa gaaccaagat gaatagcaat acaattgett ccaaaatggg ttccttctcc S40 
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:aat ^agat C rtgtagctct tcaccaaagg gaacatgttg aactgcttag agccaggaga 900 
ttagccaagt cactggccat tctcttaggg gtttttgctg tttgctgggc tccatattct 960 
■jtgttcacaa ttgtcctttc attttattcc tcagcaacag gtcctaaatc agtttggtat 1020 
agaattgcat tttggcttca gtggttcaat tcctttgtca atcctctttt gtatccattg 1080 
tgtcacaagc gctttcaaaa ggctttcttg aaaatatttt gtataaaaaa gcaacctcta 1140 
:aac acagtcggtc agtatcttct taa 1173 



■.210 14 
-211 > 390 
■■■ 2 1 2 > PRT 

<213> Hcrr.o sapiens 
4 00> 14 

Met Pro Asp Thr Asn Ser Thr lie Asn Leu Ser Leu Ser Thr Arg Val 

5 10 15 

Thr Leu Ala Phe Phe Met Ser Leu Val Ala Phe Ala lie Met Leu Gly 

20 25 30 

Asn Ala Leu Val lie Leu Ala Phe Val Val Asp Lys Asn Leu Arg His 

3 r ; 4 0 4 5 

Arg Ser Ser Tyr Phe Phe Leu Asn Leu Ala lie Ser Asp Phe Phe Val 
5 0 5 5 6 0 

Sly Val lie Ser lie Pro Leu Tyr lie Pro His Thr Leu Phe Glu Trp 
65 70 75 80 

Asp Phe Gly Lys Glu lie Cys Val Phe Trp Leu Thr Thr Asp Tyr Leu 
85 90 95 

Leu Cys Thr Ala Ser Val Tyr Asn lie Val Leu lie Ser Tyr Asp Arg 
100 105 110 

Tyr Leu Ser Val Ser Asn Ala Val Ser Tyr Arg Thr Gin His Thr Gly 
115 120 125 

'.■hi Leu Lys lie Val Thr Leu Met Val Ala Val Trp Val Leu Ala Phe 
130 135 140 

Leu Val Asn Gly Pro Met lie Leu Val Ser Glu Ser Trp Lys Asp Glu 
145 150 155 160 

Gly Ser Glu Cys Glu Pro Gly Phe Phe Ser Glu Trp Tyr lie Leu Ala 
165 170 175 

Tie Thr Ser Phe Leu Glu Phe Val He Pro Val He Leu Val Ala Tyr 
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15 0 13 5 190 

Phs Asn Met Asn He Tyr Trp Ser Leu Trp Lys Arg Asp His Leu Ser 

195 200 205 

Arg Cys Gin Ser His Pro Gly Leu Thr Ala Val Ser Ser Asn He Cys 
2 10 215 220 

Gly His Ser Phe Arg Gly Arg Leu Ser Ser Arg Arg Ser Leu Ser Ala 

22 0 2 30 235 24 0 

Jer Thr Glu Val Pro Ala Ser Phe His Ser Glu Arg Gin Arg Arg Lys 
245 250 255 

Ser Ser Leu Met Phe Ser Ser Arg Thr Lys Met Asn Ser Asn Thr He 

260 265 270 

Ala Ser Lys Met Gly Ser Phe Ser Gin Ser Asp Ser Val Ala Leu His 

275 280 235 

Gin Arg Glu His Val Glu Leu Leu Arg Ala Arg Arg Leu Ala Lys Ser 

2 90 2 95 300 

Leu Ala He Leu Leu Gly Val Phe Ala Val Cys Trp Ala Pro Tyr Ser 
H5 310 315 320 

Leu Phe Thr He Val Leu Ser Phe Tyr Ser Ser Ala Thr Gly Pro Lys 
325 330 335 

Ser Val Trp Tyr Arg He Ala Phe Trp Leu Gin Trp Phe Asn Ser Phe 

340 345 350 

V a 1 A s n Pro Leu Leu Tyr Pro Leu Cys His Lys Arg Phe Gin Lys Ala 
355 360 365 

Phe Leu Lys He Phe Cys lie Lys Lys Gin Pro Leu Pro Ser Gin His 

370 375 380 

Sor Arg Ser Val Ser Ser 

■:o5 3 90 



■:2 10> 15 

--211> 112 3 

-■212 > 2>NA 

<213> Homo sapiens 

■ . 4 0 0 > 15 
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• 


• 


• 


• 


• 




i - i ) -- -J -J - 7» 


■■ : q a q c q a a c c 


■a qq t qqca qc 


qqcqqcqccq 


aqqcqqccqc 


c c t a a a c c t c 


60 






gctgctgtgc 


gt gagectag 


egggcaacg t 


gctgt t cacg 


120 


" " a c f" aa t ca 


t acaaaaqca 


cagect gcac 


cgcgccccg t 


ac tacct get 


get cgacct g 


180 


- - ^ - - ^) 


l_i ^- -^J -J 


s - — ■ ^ ^ h — ■■ — ■ ■— j 


tacct crna 

^ l_ V_ ^ -^J 


cntcatact 




24 0 


<~i c a a c a a 




ggcgccgccg 




getgeaaget 


octcacrttr 


300 


■ ^ t h r a a 


t ctt ctgctt 


ccacgcccicc 






cat ca cr^nc 


360 


■■ ■* - - - : J J L - a 


tcgcg caeca 


ccget t ct at 


gcagagcgcc 


tggccggctg 


arcntococr' 


420 






craacac ) 'ci 


gege t ggccg 


cggccttccc 


GCCciciVnctci 

M "—A y U M \J 


4 80 


1 a "• : a c a a ^ a 


a c q a c g a c g a 


cj a a c a c a c c a 


tocaccctaa 


agcagcggcc 




54 0 


. " c ; " a a r a ( " a c 


t q g g c 1 1 c c t 


act qctqct q 


qccqtaatqq 


t gggcgccac 


gcacct cgt c 


600 




t g ctcttctt 


catccacgac 


cgccgcaaga 


tgcggcccgc 


acocct. aat a 


660 




g c c a c cj a c t g 


gacc 1 1 ccac 


qqcccaqaca 


ccaccggcca 


aqcqqccqcc 


720 


.3 <_i ... ^ ^ a y -y 


a a a c ^ i" " a a 


ccacaaaccc 


acgccgcccg 


ca c 1 1 at aaa 


cat ccggccc 


780 


a a a o c c a a 




gcgccgcctc 


c t cqtqctqq 


aagaat t caa 


gaeggagaag 


84 0 


i 'j c *~ 1 1 cr ^ a 


agatgttcta 


cgccatcacg 


ctget ctt cc 


tgctcctctg 


ggggCCCtac 


900 


j — -j L -j \j 1 -■ a 


gctacctgcg 


aatcctaata 


cqqcccqqcq 


ccgt ccccca 


ggcctacctg 


960 




t n t n n c roar 


cttcgcgcag 


geeggcat ca 


accccgt cgt 


ntar^t rrt- r 

■^J - ^— ^- L_- \_- 


1020 




C3 '"i f~* T~ n ■'"i jT t 
d-.JL L y d , -J'Jya 










i n 9, n 




cga :c::atcc 


ct gcgacctg 


aaaggcattg 


gt ttatga 




1128 



2 1 0 > 16 

^■21 1> 37 5 

■,.212> PRT 

21 3> Homo sapiens 

■■"4 0 7i> 16 

Met Ala Asn Ala Ser Glu Pro Gly Gly Ser Gly Gly Gly Glu Ala Ala 
15 10 15 

Ala Leu Gly Leu Lys Leu Ala Thr Leu Ser Leu Leu Leu Cys Val Ser 

20 25 30 

Leu Ala Gly Asn Val Leu Phe Ala Leu Leu lie Val Arg Glu Arg Ser 
3 5 4 0 4 5 

Leu His Arg Ala Pro Tyr Tyr Leu Leu Leu Asp Leu Cys Leu Ala Asp 

50 55 60 

giy Leu Arg Ala Leu Ala Cys Leu Pro Ala Val Met Leu Ala Ala Arg 

o5 70 75 80 

Arg Ala Ala Ala Ala Ala Gly Ala Pro Pro Gly Ala Leu Gly Cys Lys 
85 90 95 

Leu Leu Ala Phe Leu Ala Ala Leu Phe Cys Phe His Ala Ala Phe Leu 

100 105 110 
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Leu Gly Val oiy Val Thr Arg Tyr Leu Ala lie Ala His His Arg 
115 120 125 

"he Tyr Ala Glu Arg Leu Ala Gly Trp Pro Cys Ala Ala Met Leu Val 
130 135 140 



Cys Ala Ala Trp Ala Leu Ala Leu Ala Ala Ala Phe Pro Pro Val Leu 
14 5 150 15 5 160 

Asp Gly Gly Gly Asp Asp Glu Asp Ala Pro Cys Ala Leu Glu Gin Arg 
16 5 17 0 17 5 

Pro Asp Gly Ala Pro Gly Ala Leu Gly Phe Leu Leu Leu Leu Ala Val 
130 135 190 

Val Val Gly Ala Thr His Leu Val Tyr Leu Arg Leu Leu Phe Phe He 

195 2 00 2 05 

His Asp Arg Arg Lys Met Arg Pro Ala Arg Leu Val Pro Ala Val Ser 

210 215 220 

■[.:■; Asp Trp Thr Phe His Gly Pro Gly Ala Thr Gly Gin Ala Ala Ala 
225 230 235 240 

Asr. Trp Thr Ala Gly Phe Gly Arg Gly Pro Thr Pro Pro Ala Leu Val 
245 250 255 

Gly He Arg Pro Ala Gly Pro Gly Arg Gly Ala Arg Arg Leu Leu Val 
260 265 270 

Leu Glu Glu Phe Lys Thr Glu Lys Arg Leu Cys Lys Met Phe Tyr Ala 
275 280 285 

Val Thr Leu Leu Phe Leu Leu Leu Trp Gly Pro Tyr Val Val Ala Ser 
290 295 300 

Tyr Leu Arg Val Leu Val Arg Pro Gly Ala Val Pro Gin Ala Tyr Leu 

305 310 315 320 

Thr Ala Ser Val Trp Leu Thr Phe Ala Gin Ala Gly Tie Asn Pro Val 
325 330 335 

Vii Cys Phe Leu Phe Asn Arg Glu Leu Arg Asp Cys Phe Arg Ala Gin 
340 345 350 

Phe Pro Cys Cys Gin Ser Pro Arg Thr Thr Gin Ala Thr His Pro Cys 

355 360 365 
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1 y 1 e u 




5 7 0 




375 




1 0 > 1 7 








1H 1002 








:;.:> dna 








13> Hcrr.o 


sapiens 




.-■ 

a t 


CC> 1 7 
g a a c a ct c a 


c a g t g a t g c a 


agger: t caac 




a g t a c a g c 


tggtat tccc 


ageect ctac 


i d 


tact 1 1 gq 


ctctgtgggt 


gtttgrtcac 




c a a a a a c a 


ct ttggtggc 


cgacttgata 




'.qactcac 


acctggcacc 


ctggcagctc 


itattttatg 


agacca tgta 


tgtgggcat c 


r t 


c ctcaaga 


tea tcagacc 


ttt gaga aat 


d c 


ggt ctcaa 


tct tcatctg 


gttctttttg 


ag 


c a a e a a g g 


aagcaacacc 


a t c g t c t g t g 


■J J 


g e t. g a a a t 


ggcatcaaat 


ggtaaa taac 


at 


cct aatgc 


ttgtgtttta 


tgtggttatt 




.vaaaagt. a 


agqacagaaa 


aaacaacaaa 






ttgtgtgttt 


tgctccattt 


: d d d C t d d C d 


at aagact ga 


ct gt agactg 


d c 


tct ctttt 


t ggcagcaac 


taacatt tgt 


aaaaaattca 


cagaaaagct 


accatgtatg 


g a 


a aar.ca:a 


gcagt cagac 


agacaacata 




agatctgagc 


ggtgccccag 


agacact egg 


60 


acagtggttt 


t ct tgacegg 


cat ectget g 


120 


a tccccagct 


cct ccacctt 


catcat ctac 


130 


a tgacact ca 




caaaatcct c 


240 


agagctt ttg 


tgtgtcgttt 


t tetteggtg 


300 


gt gctgtt ag 


ggctcatagc 


ctt tgacaga 


360 


atttttctaa 


aaaaacctgt 


t tt tgcaaaa 


420 


ttcttcatct 


ccctgccaaa 


t acgatcttg 


480 


aaaaagtgtg 


ctt cct taaa 


ggggect ct g 


540 


a tatgecagt 


ttattttctg 


gactgttttt 


600 


gcaaaaaaag 


tatatgatt c 


t tatagaaag 


660 


aagctggaag 


gcaaagtat t 


t gt tgtcgtg 


720 


cat ttt geca 


gagtt cca ta 


t ac tcacagt 


780 


caaaat caac 


tgtttattgc 


t aaagaaaca 


840 


atggat ccct 


taatatacat 


att ct tatgt 


900 


caagggagaa 


agaccacagc 


ateaagecaa 


960 


acct taggct 


ga 




1002 



< 2 1 0 > 18 
Hli> 3 33 
■■212> P RT 

■■'213> Homo sapiens 
H00> 18 

M«t Asn Thr Thr Vai Met Gin Gly Phe Asn Arg Ser Glu Arg Cys Pro 
1 5 10 15 

Arg Asp Thr Arg lie Vai Gin Leu Vai Phe Pro Ala Leu Tyr Thr Vai 

20 25 30 

Vai Phe Leu Thr Gly He Leu Leu Asn Thr Leu Ala Leu Trp Vai Phe 
35 40 45 

Vai His He Pro Ser Ser Ser Thr Phe He He Tyr Leu Lys Asn Thr 

50 55 60 

Hu Vai Ala Asp Leu lie Met Thr Leu Met Leu Pro Phe Lys He Leu 
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H H 7 5 SO 

Ser Asp Ser His Leu Ala Pro Trp Gin Leu Arg Ala Fhe Val Cys Arg 
85 90 95 

The Ser Ser Val lie Fhe Tyr Glu Thr Met Tyr Val Gly lie Val Leu 
^ " 0 105 110 

Leu Gly Leu lie Ala ?he Asp Arg Phe Leu Lys lie lie Arg Pro Leu 
115 12 0 12 5 

Arg Asn lie Fhe Leu Lys Lys Pro Val Phe Ala Lys Thr Val Ser lie 
130 135 140 

Phe He Trp Fhe Fhe Leu Fhe Phe lie Ser Leu Pro Asn Thr He Leu 
145 150 155 160 

Ser Asn Lys Glu Ala Thr Fro Ser Ser Val Lys Lys Cys Ala Ser Leu 
165 170 175 

Lys Gly Fro Leu Gly Leu Lys Trp His Gin Met Val Asn Asn He Cys 
130 185 190 

Gin Fhe He Fhe Trp Thr Val Phe lie Leu Met Leu Val Phe Tyr Val 
195 200 205 

Val He Ala Lys Lys Val Tyr Asp Ser Tyr Arg Lys Ser Lys Ser Lys 
210 215 220 

Asp Arg Lys Asn Asn Lys Lys Leu Glu Gly Lys Val Phe Val Val Val 

22 5 230 235 240 

Ala Val Phe Phe Val Cys Phe Ala Pro Phe His Phe Ala Arg Val Pro 
245 250 255 

Tyr Thr His Ser Gin Thr Asn Asn Lys Thr Asp Cys Arg Leu Gin Asn 

260 265 270 

GH Leu Fhe He Ala Lys Glu Thr Thr Leu Phe Leu Ala Ala Thr Asn 
275 280 285 

He Cys Met Asp Pro Leu He Tyr He Phe Leu Cys Lys Lys Phe Thr 

290 295 300 

Glu Lys Leu Fro Cys Met Gin Gly Arg Lys Thr Thr Ala Ser Ser Gin 
^05 310 315 320 

Glu Asn His Ser Ser Gin Thr Asp Asn He Thr Leu Gly 
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33C 



•:213> 19 

< 2 1 1 > 1122 

■212 > 7NA 

■ ■ 2 1 3 > Hg.t.o sapiens 



•-: 4 0 0 > 
atggc 


1 9 

a a c a 


c t aceggaga 


gectgaggag 


gtgagcggcg 


ct ct gt cccc 


accgt ccgea 


60 




— atg 


tgaagctggt 


act gctggga 


ctgat tat gt 


gcgtgagcct 


ggegggtaac 


120 


jccat 


cttgt 


ccctgctggt 


get caaggag 


cgt geect gc 


acaaggctcc 


t tactactt c 


130 




a g a c c 


tgtgcctggc 


cgatggcat a 


cgctctgccg 


t ctgett ccc 


ctt tgtgct g 


240 




tgtgc 


gccacggct c 


t tcatggacc 


tt cagtgeae 


t cagctgeaa 


gat tgtggcc 


300 


t t t a t 


ggccg 


tgctcttttg 


ctt ecatgeg 


gect teatge 


tgt tetgeat 


cagegt cacc 


360 


- : ". t a 


catgg 


ccatcgccca 


ccaccgct tc 


tacgccaagc 


gcatgacact 


ctggacatgc 


420 


qcggc 


tgtca 


tetgeatgge 


ct ggaccct g 


t ct gtggcca 


t ggcett ccc 


acctgtctt t 


430 


gacgt 


gggca 


cctacaagtt 


tat tegggag 


gaggaccagt 


gcatctt t ga 


gcatcgctac 


540 


ucaa 


qgcca 


at gaeacget 


gggctt catg 


ctt atgttgg 


ctgtgct cat 


ggcagct acc 


600 


" ; ' ■•' - 


tgtct 


aeggcaaget 


gctectct tc 


gagtatcgt c 


acegcaagat 


gaagccagt g 


660 


o a gar- 


ggtgc 


cagccatcag 


ccagaactgg 


acatt ccatg 


gtcccggggc 


caccggccag 


720 


ret gc 


t g c c a 


act ggatege 


eggett tggc 


cgt gggecca 


tgccaccaac 


cct gctgggt 


780 




at gggca n gc 


agccagccgg 


eggctactgg 


gca tggacga 


ggt caagggt 


840 


i . j a a a 


qcaqc 


rgggcegcat 


gt tct aegeg 


atcacact gc 


tetttctget 


cct ct ggtca 


900 


e /.;ta 


catcg 


tggect gcta 


c tggcgagtg 


t ttgt gaaag 


cct gt get gt 


gccccaccgc 


960 




ggcca 


ct gc tgtt tg 


gatgagcttc 


gcccaggctg 


ccgt caaccc 


aat tgtctgc 


1020 




get ca 


acaaggacct 


caagaagt gc 


ctgaccactc 


acgccccctg 


ctggggcaca 


1080 


ggaggt gccc 


cggctcccag 


agaaccct ac 


t gtgt cat gt 


ga 




1122 



■-.-:io> 20 
■■2i :> 373 

2 1 2 > PRT 
<2:3> Homo sapiens 

•-4CO 20 

Met Ala Asn Thr Thr Gly Glu Pro Glu Glu Val Ser Gly Ala Leu Ser 

5 10 15 

Pro Pro Ser Ala Ser Ala Tyr Val Lys Leu Val Leu Leu Gly Leu lie 
20 25 30 

Mot Cys Val Ser Leu Ala Gly Asn Ala lie Leu Ser Leu Leu Val Leu 
35 40 45 

Lys Glu Arg Ala Leu His Lys Ala Pro Tyr Tyr Phe Leu Leu Asp Leu 

5 0 5 5 60 
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• 'ys L-u Ala Asp Gly lie Arg Ser Ala Val Cys ?he Fro ?he Val Leu 

6 5 70 75 80 

Ala Ser Vai Arg His Gly Ser Ser Trp Thr Phe Ser Ala Leu Ser Cys 
85 90 95 

Lys lie Val Ala Fhe Met Ala Val Leu Phe Cys Phe His Ala Ala Phe 
100 105 110 

Leu Fhe Cys lie Ser Vai Thr Arg Tyr Met Ala lie Ala His His 
115 120 125 

Arg Fhe Tyr Ala Lys Arg Met Thr Leu Trp Thr Cys Ala Ala Val He 
13C 135 140 

"ys Met Ala Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Phe 
14 5 150 155 160 

Asp Val Gly Thr Tyr Lys Phe He Arg Glu Glu Asp Gin Cys He Phe 

16 5 17 0 17 5 

Glu His Arg Tyr Fhe Lys Ala Asn Asp Thr Leu Gly Phe Met Leu Met 
180 135 190 

Leu Ala Val Leu Met Ala Ala Thr His Ala Val Tyr Gly Lys Leu Leu 
195 200 205 

Leu Fhe Glu Tyr Arg His Arg Lys Met Lys Pro Val Gin Met Val Pro 
210 215 220 

Ala lie Ser Gin Asn Trp Thr Phe His Gly Pro Gly Ala Thr Gly Gin 

225 230 235 240 

Ala Ala Ala Asn Trp He Ala Gly Phe Gly Arg Gly Pro Met Pro Pro 
245 250 255 

Thr Leu Leu Gly He Arg Gin Asn Gly His Ala Ala Ser Arg Arg Leu 

260 265 270 

Leu Gly Met Asp Giu Val Lys Gly Glu Lys Gin Leu Gly Arg Met Phe 
275 2S0 285 

Tyr Ala lie Thr Leu Leu Phe Leu Leu Leu Trp Ser Pro Tyr He Val 

290 295 300 

Ala Cys Tyr Trp Arg Val Phe Val Lys Ala Cys Ala Val Pro His Arg 

305 310 315 320 
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,t : Ala Thr Ala Val Trp Met 3er Phe Ala Gin Ala Ala Vai Asn 
325 330 335 

:o lie Val Cys Phe Leu Lea Asn Lys Asp Leu Lys Lys Cys Leu Thr 
340 345 350 

■r His Ala Fro Cys Trp Gly Thr Gly Gly Ala Pro Ala Pro Arg Glu 
355 360 365 

;o Tyr Cys Val Met 



■y 



■ 2 1 1 > 10 5 3 
< 2 1 2 > DMA 

■-"'21 3> Homo sapiens 

--■KJ0> 2 1 

atggctttgg aacagaacca gtcaacagat tattattatg aggaaaatga aatgaatggc 60 

/'"V uottatgact acagtcaata tgaattgatc tgtatcaaag aagatgtcag agaatttgca 120 

aaaqttttoc tccctgtatt cctcacaata gctttcgtca ttggacttgc aggcaattcc 130 

atqgtagtgg caatttatgc ctattacaag aaacagagaa ccaaaacaga tgtgtacatc 240 

:tgaatttgg ctgtagcaga tttactcctt ctattcactc tgcctttttg ggctgttaat 300 

g~agttcatg ggtgggtttt agggaaaata atgtgcaaaa taacttcagc cttgtacaca 360 

.^.aaactttg tctctggaat gcagtttctg gcttgcatca gcatagacag atatgtggca 420 

gtaaotaatg tccccagcca atcaggagtg ggaaaaccat gctggatcat ctgtttctgt 480 

gtctggatgg ctgccatctt gctgagcata ccccagctgg ttttttatac agtaaatgac 540 

aatgctaggt gcattcccat tttcccccgc tacctaggaa catcaatgaa agcattgatt 600 

■:aaatqctag agatctgcat tggatttgta gtaccctttc ttattatggg ggtgtgctac 660 

ittatcacgg caaggacact catgaagatg ccaaacatta aaatatctcg acccctaaaa 720 

jttcrrgctca cagtcgttat agttttcatt gtcactcaac tgccttataa cattgtcaag 780 

t t ct gccgag ccatagacat catctactcc ctgatcacca gctgcaacat gagcaaacgc 840 

a tggacat eg ccatccaagt cacagaaagc attgeactet ttcacagctg cctcaaccca 900 

utcctttatg tttttatggg agcatctttc aaaaactacg ttatgaaagt ggccaagaaa 960 

tatgggtcct ggagaagaca gagacaaagt gtggaggagt ttccttttga ttctgagggt 1020 

o:tacagagc caaccagtac ttttagcatt taa 1053 



<-211> 3 50 

• . : ,: • b.t 

■ ■ 2 1 3 > H c mo sapiens 



Met Ala Leu Glu Gin Asn Gin Ser Thr Asp Tyr Tyr Tyr Glu Glu Asn 
1 5 10 15 
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• • • • 



20 2 5 30 

Lys 31 u Asp Vai Arg Glu ?he Ala Lys Val Fhe Leu Fro Val Fhe Leu 
3 5 4 0 4 5 

Thr He Ala Phe Val lie Gly Leu Ala Gly Asn Ser Met Val Val Ala 

50 55 60 

lie Tyr Ala Tyr Tyr Lys Lys Gin Arg Thr Lys Thr Asp Val Tyr He 

65 70 75 80 

Leu As:: Leu Ala Val Ala Asp Leu Leu Leu Leu Phe Thr Leu Pro Phe 

35 90 95 

Trp Ala Val Asn Ala Val His Gly Trp Val Leu Gly Lys He Met Cys 
100 105 110 

Lys Lie Thr Ser Ala Leu Tyr Thr Leu Asn Phe Val Ser Gly Met Gin 
115 120 125 

Pho Leu Ala Cys He Ser He Asp Arg Tyr Val Ala Val Thr Asn Val 
150 135 140 

Pro Ser Gin Ser Gly Val Gly Lys Pro Cys Trp lie lie Cys Phe Cys 
145 150 155 160 

Val Trp Met Ala Ala lie Leu Leu Ser He Pro Gin Leu Val Phe Tyr 
165 170 175 

Thr Val Asn Asp Asn Ala Arg Cys He Pro He Phe Pro Arg Tyr Leu 
130 185 190 

Sly Thr Ser Met Lys Ala Leu He Gin Met Leu Glu He Cys He Gly 

195 200 205 

Phe Val Val Pro Phe Leu He Met Gly Val Cys Tyr Phe He Thr Ala 
2 10 215 2 2 0 

Arg Thr Leu Met Lys Met Pro Asn lie Lys lie Ser Arg Pro Leu Lys 

2.H 230 235 240 

Val Leu Leu Thr Val Val He Val Phe lie Val Thr Gin Leu Pro Tyr 
245 250 255 

Asn He Val Lys Phe Cys Arg Ala lie Asp lie lie Tyr Ser Leu lie 

260 265 270 
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Ger 3ys Asn Met ie: Lys Arg Met Asp lie Ala He Gin Val Thr 

2^5 230 285 

^u Ser lie Ala Leu Phe His Ser Cys Leu Asn Pro He Leu Tyr Val 

290 295 300 

he Met Gly Ala Ser Phe Lys Asn Tyr Val Met Lys Val Ala Lys Lys 

31C 315 320 

yr 21 y Ser Trp Arg Arg Gin Arg Gin Ser Val GIu Glu Phe Pro Phe 
325 330 335 



Giy Pro Thr Glu Pro Thr Ser Thr Phe Ser He 

34 0 34 5 350 



210> 23 

211> 1116 

Z. _ 2 > 2 N A 

2 1 3 Homo sap iens 



r. qccaggaa 


a c gccacccc 


agtgaccacc 


actgccccgt 


gggcctccct 


gggect ctcc 


60 


: .: a a g a c c t 


g c a a c a a c g t 


gt cct toga a 


gagagcagga 


tagtcct ggt 


cgtggt gtac 


120 


gcgcggtgt 


gcacgctggg 


ggt gccggcc 


aactgectga 


ct gcgtggct 


ggeget gctg 


180 


aggtact gc 


agggcaacgt 


get ggccgt c 


tacct gctct 


gcctggcact 


ctgegaact g 


240 


t. c r. a z a c a g 


gcacgct gcc 


act ct gggtc 


atctatat cc 


gcaaccagca 


ccgctggacc 


300 


taggcct gc 


t ggcctcgaa 


ggt gaccgcc 


tacatcttct 


tetgeaacat 


etaegt cage 


360 


tcctcttc c 


t gtgc tgcat 


ct cctgcgac 


cgett cgtgg 


ccgtggt gta 


cgcgctggag 


420 


gtcggggcc 


gccgccgccg 


gaggaccgcc 


at cct catct 


ccgcctgcat 


ct teat cct c 


480 


tcggqatog 


t tcactaccc 


ggt gt tccag 


aeggaagaca 


aggagacctg 


ct ttgacat g 


540 


-. goagatgg 


acagcaggat 


t geegggtae 


t act aegeca 


ggttcaccgt 


tggctt tgee 


600 


r. :cctctct 


ccatcat cgc 


ctt caccaac 


caceggattt 


tcaggagcat 


caagcagagc 


660 


tgggcttaa 


gcgct gccca 


gaaggccaag 


gt gaagcact 


eggecat cgc 


ggtggt tgtc 


720 


tcttcetag 


tctgcttcgc 


cccgtaccac 


ct ggt tctcc 


tegtcaaage 


cgctgcctt t 


780 


■z c t a c t a c a 


gaggagacag 


gaacgccatg 


tgegget tgg 


aggaaaggct 


gtacacagcc 


840 


ctgtggtgt 


t tctgtgcct 


gt ecaeggtg 


aacggcgtgg 


ctgaccccat 


tatctaegtg 


900 


t ggccacgg 


accat tcccg 


ccaagaagt g 


tccagaatcc 


at aaggggtg 


gaaagagtgg 


960 


cca tgaaga 


cagacgt cac 


caggctcacc 


cacagcaggg 


acaccgagga 


getgeagt eg 


1020 


::cgtggecc 


t tgcagacca 


ctacaccttc 


t ccaggcccg 


tgcacccacc 


agggtcacca 


108 0 


go cct gcaa 


agaggctgat 


tgaggagtcc 


tget ga 






1116 



2 13> 2 4 

2 11> 37 1 

212 > PRT 

21 3 > Hor.o sac iens 
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• • • • 



Xet Fro Gly Asn Ala Thr Pro Val Thr Thr Thr Ala Pro Trp Ala Ser 
15 10 15 

Leu Gly Leu Ser Ala Lys Thr Cys Asn Asn Val Ser Phe Glu Glu Ser 

20 25 30 

Arg lie Val Leu Val Val Val Tyr Ser Ala Val Cys Thr Leu Gly Val 
3 5 4 0 4 5 

Pro Ala Asn Cys Leu Thr Ala Trp Leu Ala Leu Leu Gin Val Leu Gin 

50 55 60 

Gly Asn Val Leu Ala Val Tyr Leu Leu Cys Leu Ala Leu Cys Glu Leu 
h5 7 0 7 5 8 0 

Leu Tyr Thr Gly Thr Leu Pro Leu Trp Val He Tyr He Arg Asn Gin 
35 90 95 

His Ar-j Trp Thr Leu Gly Leu Leu Ala Ser Lys Val Thr Ala Tyr He 
100 105 110 

Phe Phe Cys Asn He Tyr Val Ser He Leu Phe Leu Cys Cys He Ser 
H 5 12 0 12 5 

Cys Asp Arg Phe Val Ala Val Val Tyr Ala Leu Giu Ser Arg Gly Arg 

1 3 0 13 5 14 0 

Arg Arg Arg Arg Thr Ala He Leu He Ser Ala Cys He Phe He Leu 
145 150 155 160 

VH Gly He Val His Tyr Pro Val Phe Gin Thr Glu Asp Lys Glu Thr 
165 170 175 

Cys Phe Asp Met Leu Gin Met Asp Ser Arg He Ala Gly Tyr Tyr Tyr 
180 185 190 

Ala Arg Phe Thr Val Gly Phe Ala He Pro Leu Ser He He Ala Phe 

195 200 205 

Thr Asn His Arg He Phe Arg Ser He Lys Gin Ser Met Gly Leu Ser 

210 215 220 

Ala Ala Gin Lys Ala Lys Val Lys His Ser Ala He Ala Val Val Val 

025 230 235 240 

: ; •> Phe Leu Val Cys Phe Ala Pro Tyr His Leu Val Leu Leu Val Lys 
245 250 255 
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- -) 



\ia Aia Ala Fhe Ser iyr Tyr Arg Gly Asp Arg Asn Ala Met Cys Gly 

260 265 270 

_eu Glu Glu Arg Leu Tyr Thr Ala Ser Val Val Phe Leu Cys Leu Ser 

2^5 2SQ 285 

/hr Val Asn Gly Val Ala Asp Pro lie He Tyr Val Leu Ala Thr Asp 

290 295 300 

:L:3 Ser Arg Gin Glu Val Ser Arg He His Lys Gly Trp Lys Glu Trp 

;05 310 315 320 

■er % Aei Lys Thr Asp Val Thr Arg Leu Thr His Ser Arg Asp Thr Glu 

325 330 335 

Uu Leu Gin Ser Pro Val Aia Leu Ala Asp His Tyr Thr Phe Ser Arg 
340 345 350 

: tj Val His Pro Pro Gly Ser Pro Cys Pro Aia Lys Arg Leu lie Glu 
355 360 365 



■■2I1> 1113 
<212> DNA 

•■2 13> Homo sapiens 
•■100> 2 5 

atggcgaact atagccatgc agctgacaac attttgcaaa atctctcgcc tctaacagcc 60 

tttctgaaac tgacttcctt gggtttcata ataggagtca gcgtggtggg caacctcctg 120 

atctccattt tgctagtgaa agataagacc ttgcatagag caccttacta cttcctgttg 180 

jatctttgct qttcagatat cctcagatct gcaatttgtt tcccatttgt gttcaactct 240 

Ttcaaaaatg gctctacctg gacttatggg actctgactt gcaaagtgat tgcctttctg 300 

ggqgttttgt cctqtttcca cactgctttc atgctcttct gcatcagtgt caccagatac 360 

*: t agotatcg cccatcaccg cttctataca aagaggctga ccttttggac gtgtctggct 420 

'in gat ot gt a tggtgtggac tctgtctgtg gccatggcat ttcccccggt tttagacgtg 430 

igcacttact cattcattag ggaggaagat caatgcacct tccaacaccg ctccttcagg 540 

:otaatgatt ccttaggatt tatgctgctt cttgctctca tcctcctagc cacacagctt 600 

jtctaoctca agctgatatt tttcgtccac gatcgaagaa aaatgaagcc agtccagttt 660 

jtagcagcag tcagccagaa ctggactttt catggtcctg gagccagtgg ccaggcagct 720 

jooaattggc tageaggatt tggaaggggt cccacaccac ccaccttgct gggcatcagg 780 

:aaaatgcaa acaccacagg cagaagaagg ctattggtct tagacgagtt caaaatggag 840 

.aaagaatca gcagaatgtt ctatataatg acttttctgt ttctaacctt gtggggcccc 900 

'acctggtgg ootgttattg gagagttttt gcaagagggc ctgtagtacc agggggattt 960 
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• ;:;:•* ; ".r t .5 1 c t 5 g ^ t gagttttgcc caagcaggaa tcaacccttt tgtctgcatt 1C20 
r.t:t:daaca gggagctgag gcgctgtttc agcacaaccc ttctttactg cagaaaatcc 1080 
aggttaccaa ^ggaacctta ctgtgttata tga 1113 



■■■/:::;> ::6 

■ 2 1 1 "> 3^0 
■-212> PRT 

• ". 2 13 > Homo sapiens 



Met Ala Asn Tyr Ser His Ala Ala Asp Asn lie Leu Gin Asn Leu Ser 

5 10 15 

Pro Leu Thr Ala Phe Leu Lys Leu Thr Ser Leu Gly Phe He He Gly 

20 25 30 

7al Ser Val Val Gly Asn Leu Leu He Ser He Leu Leu Val Lys Asp 
3 5 4 0 4 5 

lys Thr Leu His Arg Ala Pro Tyr Tyr Phe Leu Leu Asp Leu Cys Cys 

5 0 5 5 60 

Ser Asp lie Leu Arg Ser Ala He Cys Phe Pro Phe Val Phe Asn Ser 

65 70 75 80 

Val Lys Asn Gly Ser Thr Trp Thr Tyr Gly Thr Leu Thr Cys Lys Val 
85 90 95 

Tie Ala Phe Leu Gly Val Leu Ser Cys Phe His Thr Ala Phe Met Leu 
10 0 10 5 110 

Phe Cys He Ser Val Thr Arg Tyr Leu Ala He Ala His His Arg Phe 
115 120 125 

Tyr Thr Lys Arg Leu Thr Phe Trp Thr Cys Leu Ala Val He Cys Met 
130 13 5 14 0 

Vil Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Leu Asp Val 
LIS 150 155 160 

Gly Thr Tyr Ser Phe He Arg Glu Glu Asp Gin Cys Thr Phe Gin His 
16 5 17 0 17 5 

Arg Ser Phe Arg Ala Asn Asp Ser Leu Gly Phe Met Leu Leu Leu Ala 
18 0 18 5 190 

L--u He Leu Leu Ala Thr Gin Leu Val Tyr Leu Lys Leu He Phe Phe 
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2 0 5 



His Asp Arg Arg Lys Met Lys Pro Val Gin Phe Val Ala Ala Val 

210 215 220 



Asn Trp Thr Phe E-is 



Gly Ala Ser Gly Gin Ala Ala 
235 240 



l.i Asn Trp leu Ala Gly Phe Gly Arg Gly Pre Thr Fro Pro Thr Leu 
245 250 255 

e-j Gly lie Arg Gin Asn Ala Asn Thr Thr Gly Arg Arg Arg Leu Leu 

260 265 270 

ai leu Asp Glu Phe Lys Met Glu Lys Arg lie Ser Arg Met Phe Tyr 

275 230 285 

le Met Thr Phe Leu Phe Leu Thr Leu Trp Gly Pro Tyr Leu Val Ala 

7 00 2 95 300 

ys Tyr Trp Arg Val Phe Ala Arg Gly Pro Val Val Pro Gly Gly Phe 

0 5 310 315 32 0 

Thr Ala Ala Val Trp Met Ser Phe Ala Gin Ala Gly lie Asn Pro 

325 330 335 

:\r Val Gys lie Phe Ser Asn Arg Glu Leu Arg Arg Cys Phe Ser Thr 
340 345 350 

hr Leu Leu Tyr Cys Arg Lys Ser Arg Leu Pro Arg Glu Pro Tyr Cys 

35 5 3 60 3 65 



l1 lie 
37Q 



210> 27 
211> 1030 
212 > ON A 

2 1 y> Homo sapiens 
400> 2^ 

tg-:aggtcc -igaacagcac cggcccggac 
-gatcgegg tggccctgcc cgtggtgtac 
acctcttct ctctgtgggt gctgtgccgg 
tcatgatca acctgagegt cacggacctg 
astaocatt gcaaccgcca ccactgggta 
-^gestttt aegcaaacat gtattccagc 



aacgcgacgc tgeagatget gcggaacccg 60 

tcgctggtgg eggeggtcag catcccgggc 120 

cgcatggggc ccagatcccc gteggtcate 130 

atgctggcca gcgtgttgcc tttccaaatc 240 

ttcggggtgc tgctttgcaa cgtggtgacc 300 

atcctcacca tgacctgtat cagcgtggag 360 
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■' 1 


■..11 


— 3 : 


ggglcclgta 


ccegctcagc 


tccaagcgct 


ggcgccgccg 


t cgttacgcg 


420 




j -J - 


cgogt 


gtgcagggac 


ctggctgctg 


ct cctgaccg 


ccct gt gece 


gctggcgcgc 


480 


a r 


eg. a 


tctca 


cct acccggt 


gcacgccct g 


ggca teat ca 


ectget tcga 


cgtcctcaag 


540 


1 


gac 


gatgc 


rccccagcgt 


ggccatgtgg 


gccgtgtt cc 


tct tcaccat 


ct tcatcctg 


600 








t cccgt t cgt 


gat caccgt g 


gctt gt taca 


cggccaccat 


cct caagct g 


660 




H ■ * 


a c g g 


aggaggcgca 


cggccgggag 


cageggagge 


gcgcggtggg 


cctggccgcg 


720 






ct.t jc 


1 g g c c 1 1 1 g t 


cacctgcttc 


gcccccaa ca 


actt cgtgct 


cct ggegcac 


780 


■a t 


cgt 


gagcc 


gcctgttcta 


cggcaagagc 


tact accacg 


tgtacaagct 


cacget gt gt 


840 


1- 


::ag 


e t q c c 


tcaacaactg 


t ct ggacccg 


tttgtttatt 


actt tgcgtc 


ccgggaat tc 


900 


. Li 


.g,V. 




t gcgggaata 


t tt gggctgc 


cgccgggtgc 


ccagagacac 


cctggacacg 


960 


;gccg 


cgaga 


gcctcttctc 


cgccaggacc 


aegt ccgtgc 


gctccgaggc 


cggtgcgcac 


1020 




1 g a a g g g a 


t ggagggagc 


caccaggccc 


ggcctccaga 


ggcaggagag 


t gtgt tctga 


1080 



:210> 23 

-.21 i> 359 

:212> FRT 

.'2 11> H.cmo sapiens 



■-10 0., 2 8 
'e r Gin 7a 1 



Pro Asn Ser Thr Gly Pro Asp Asn Ala Thr Leu Gin Met 
5 10 15 



Leu Arg Asn Pro Ala lie Ala 7al Ala Leu Pro Vai Val Tyr Ser Leu 
20 25 30 

7a 1 Ala Ala Val Ser ile Pro Gly Asn Leu Phe Ser Leu Trp Val Leu 
3 5 4 0 4 5 

Cys Arg Arg Met Gly Pro Arg Ser Pro Ser Val lie Phe Met lie Asn 

50 55 60 

Leu Ser Vai Thr Asp Leu Met Leu Ala Ser Val Leu Pro Phe Gin Ile 

65 70 75 80 

Tyr Tyr His Cys Asn Arg His His Trp Val Phe Gly Val Leu Leu Cys 

85 90 95 

Asn Vai Val Thr Val Ala Phe Tyr Ala Asn Met Tyr Ser Ser Ile Leu 
100 105 110 

Thr Vet Thr Cys Ile Ser Vai Glu Arg Phe Leu Gly Val Leu Tyr Pro 
115 120 125 

Leu Ser Ser Lys Arg Trp Arg Arg Arg Arg Tyr Ala Val Ala Ala Cys 
130 135 140 



/ \ i a j iv 



Trp Leu Leu Leu Leu Thr Ala Leu Cys Pro Leu Ala Arg 
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150 



155 160 



Thr Asp Leu Thr Tyr Fro Val His Ala Leu Gly He He Thr Cys Phe 
165 170 175 

Asp Val Leu Lys Trp Thr Met Leu Pro Ser Val Ala Met Trp Ala Val 
ISO 185 190 

Phe Leu Phe Thr He Phe lie Leu Leu Phe Leu He Pro Phe Val He 
195 200 205 

Thr Val Ala Cys Tyr Thr Ala Thr He Leu Lys Leu Leu Arg Thr Glu 

210 215 220 

Glu Ala His Gly Arg Glu Gin Arg Arg Arg Ala Val Gly Leu Ala Ala 
225 230 235 240 

Val Val Leu Leu Ala Phe Val Thr Cys Phe Ala Pro Asn Asn Phe Val 
245 250 255 

Leu Leu Ala His He Val Ser Arg Leu Phe Tyr Gly Lys Ser Tyr Tyr 

260 265 270 

His Val Tyr Lys Leu Thr Leu Cys Leu Ser Cys Leu Asn Asn Cys Leu 
275 280 285 

Asp Pro Phe Val Tyr Tyr Phe Ala Ser Arg Glu Phe Gin Leu Arg Leu 
290 295 300 

Arg Glu Tyr Leu Gly Cys Arg Arg Val Pro Arg Asp Thr Leu Asp Thr 

305 310 315 320 

Arg Arg Glu Ser Leu Phe Ser Ala Arg Thr Thr Ser Val Arg Ser Glu 
325 330 335 

Ala Gly Ala His Pro Glu Gly Met Glu Gly Ala Thr Arg Pro Gly Leu 
340 345 350 

Gin Arg Gin Glu Ser Val Phe 
3 55 



-V:10> 2 9 

< 2 1 1 > 150 3 
•'212> CNA 

< 2 I 3> Homo sapiens 
■■ 4 0 0 > 2 9 
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ggagogt c z r.z gggagga cagcccaggc ccggaggggg cagctgaggg ctcgcctgtg 60 

• • i :\ .• :c:."7 ccggggcgcg ctccggtgcc gcggcgagtg gcacaggctg gcagccatgg 120 
gctgagtgcc cgggacccaa ggggaggggg caactgctgg cgaccgccgg ccctttgcgt 180 

L^gcrggcccg ccccctcgcc tgccagctcc agccccgccc crcggagcggc gtccgctcac 240 

*:cggttcaag gcagcgcgac tgcgggtggc gcacgaccag ggcgcagacc ttggggcgcg 300 

• : ; : * :-i agtcggggct gctgcggccg gcgccggtga gcgaggtcat cgtcctgcat 360 
* ■. ' : ; -j ccggcaagct ccgcggtgcg agctaccagc cgggtgccgg cctgcgcgcc 4 20 
jacgccgtgg tgtgcctggc ggtgtgcgcc ttcatcgtgc tagagaatct agccgtgttg 480 

ggtgctcg gacgccaccc gcgcttccac gctcccatgt tcctgctcct gggcagcctc 54 0 

■acgttgtcgg atctgctggc aggcgccgcc tacgccgcca acatcctact gtcggggccg 600 

jtcacactga aactgtcccc cgcgctctgg ttcgcacggg agggaggcgt cttcgtggca 660 

■jtcact gcgt cegtgctgag cctcctggcc atcgcgctgg agcgcagcct caccatggcg 720 

ogcagggggc ccgcgcccgt ctccagtcgg gggcgcacgc tggcgatggc agccgcggcc "80 

-.ggggcgtgt cgctgcicct cgggctcctg ccagcgctgg gctggaattg cctgggtcgc 340 

■■'tggacgctt gctccactgt cttgccgctc tacgccaagg cctacgtgct cttctgcgtg 900 

ct cgcct t eg tgggcatcct ggccgcgatc tgtgcactct acgcgcgcat ctactgccag 960 

gtacgcgcca acgcgcggcg cctgccggca cggcccggga ctgeggggae cacctcgacc 1020 

cgggcgcgtc gcaagccgcg ctctctggcc ttgetgegea cgctcagcgt ggtgctcctg 1080 

jectttgtgg ::a t gt tgggg ccccctcttc ctgctgctgt tgctcgacgt ggcgtgcccg 1140 

gcgcgcacct gtcctgtact cctgcaggcc gatcccttcc tgggactggc catggccaac 1200 

:cacttctga accccatcat ctacacgctc accaaccgcg acctgcgcca cgcgctcctg 1260 

: .;c::t ggt ct getgeggacg ccactcctgc ggcagagacc cgagtggctc ccagcagtcg 1320 

gegagegegg ctgaggcttc egggggectg cgccgctgcc tgcccccggg ccttgatggg 13S0 

agct neageg geteggageg ctcatcgccc cagcgcgacg ggctggacac cagcggctcc 1440 

acaggcagcc ccggtgcacc cacagccgcc eggactet gg tatcagaacc ggctgeagae 1500 

caa 1503 



<210> 30 
< 2 1 1 > 500 
<212> PRT 

■■ 2 1 3 > Homo sapiens 
<4CC> 30 

Met Glu Arg Pro Trp Glu Asp Ser Pro Gly Pro Glu Gly Ala Ala Glu 
15 10 15 

Gly Ser Pro Val Pro Val Ala Ala Gly Ala Arg Ser Gly Ala Ala Ala 

20 25 30 

Ser Gly Thr Gly Trp Gin Pro Trp Ala Glu Cys Pro Gly Pro Lys Gly 
35 40 45 

Arg Gly Gin Leu Leu Ala Thr Ala Gly Pro Leu Arg Arg Trp Pro Ala 

5 0 5 5 60 

Pro Ser Pro Ala Ser Ser Ser Pro Ala Pro Gly Ala Ala Ser Ala His 

6 5 7 0 7 5 8 0 
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2- : r Val Gin Gly Ser Ala Thr Ala Gly Gly Ala Arg Pro Gly Arg Arg 
35 90 95 

p ro Trp Gly Ala Arg Pro Met Glu Ser Gly Leu Leu Arg Pro Ala Pro 

100 105 110 

7a 1 Ser Glu Val He Val Leu His Tyr Asn Tyr Thr Gly Lys Leu Arg 
115 120 125 

Gly Ala Ser Tyr Gin Pro Gly Ala Gly Leu Arg Ala Asp Ala Val Val 
130 135 140 

Gys Leu Ala Val Gys Ala Phe He Val Leu Glu Asn Leu Ala Val Leu 
1-3 5 150 155 160 

Leu Val Leu Gly Arg His Pro Arg Phe Kis Ala Pro Met Phe Leu Leu 
165 170 175 

Leu GLv Ser Leu Thr Leu Ser Asp Leu Leu Ala Gly Ala Ala Tyr Ala 
180 185 190 

Ala Asn He Leu Leu Ser Gly Pro Leu Thr Leu Lys Leu Ser Pro Ala 

195 200 205 

Leu Trp Phe Ala Arg Glu Gly Gly Val Phe Val Ala Leu Thr Ala Ser 

210 215 2 2 0 

Val Leu Ser Leu Leu Ala He Ala Leu Glu Arg Ser Leu Thr Met Ala 

225 230 235 240 

Arg Arg Gly Pro Ala Pro Val Ser Ser Arg Gly Arg Thr Leu Ala Met 
245 250 255 

Ala Ala Ala Ala Trp Gly Val Ser Leu Leu Leu Gly Leu Leu Pro Ala 

260 265 270 

Leu Gly Trp Asn Cys Leu Gly Arg Leu Asp Ala Cys Ser Thr Val Leu 

275 230 285 

Fro Leu Tyr Ala Lys Ala Tyr Val Leu Phe Cys Val Leu Ala Phe Val 

290 295 300 

Gly lie Leu Ala Ala He Cys Ala Leu Tyr Ala Arg He Tyr Cys Gin 

.3 0 5 310 315 320 

Val Arg Ala Asn Ala Arg Arg Leu Pro Ala Arg Pro Gly Thr Ala Gly 
325 330 335 
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Thr Thr Ser . hr Arg Aia Arg Arg Lys Pro Arg Ser Leu Ala Leu Leu 
340 345 350 

Arg Thr Leu Ser Val Val Leu Leu Ala Fhe Val Ala Cys Trp Gly Pro 

355 360 365 

Leu Phe Leu Leu Leu Leu Leu Asp Val Ala Cys Pro Ala Arg Thr Cys 

3-3 3 7 5 3 8 0 

Pro Val Leu Leu Gin Ala Asp Pro Phe Leu Gly Leu Ala Met Ala Asn 
3*5 390 395 400 

Ser Leu Leu Asn Pro lie lie Tyr Thr Leu Thr Asn Arg Asp Leu Arg 
405 410 415 

His Ala Leu Leu Arg Leu Val Cys Cys Gly Arg His Ser Cys Gly Arg 
42 0 425 430 

Aso Pro Ser Gly Ser Gin Gin Ser Ala Ser Ala Ala Glu Ala Ser Gly 
435 440 445 

Gly Leu Arg Arg Cys Leu Pro Pro Gly Leu Asp Gly Ser Phe Ser Gly 
4 50 4 55 4 60 

Ser Glu Arg Ser Ser Pro Gin Arg Asp Gly Leu Asp Thr Ser Gly Ser 
46 5 4 70 4 75 4 80 

Thr Gly Ser Pro Gly Aia Pro Thr Ala Ala Arg Thr Leu Val Ser Glu 
485 490 495 

Pro Ala Ala Asp 
500 



< 2 1 0 > 31 

< 2 1 1 > 1029 
•■■2 12> DNA 

•■■ 2 1 3 > Home sapiens 
4^n> 31 

■itqcaagccg tcgacaatct cacctctgcg cctgggaaca ccagtctgtg caccagagac 60 
tdcaaaatca cccaggtcct cttcccactg ctctacactg tcctgttttt tgttggactt 120 
atoacaaatg gectggegat gaggattttc tttcaaatcc ggagtaaatc aaactttatt 180 
af:t:ttctta agaacacagt catttctgat cttctcatga ttctgacttt tccattcaaa 240 
attcttagtg atgecaaact gggaacagga ccactgagaa cttttgtgtg tcaagttacc 300 
*_ccgtcatat tttatttcac aatgtatatc agtatttcat tcctgggact gataactatc 360 
-ja *: eg a cc agaagaccac caggecattt aaaacatcca accccaaaaa tctcttgggg 420 
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; - *. i a q a 


- r -' : 


z *- r. ;~ 


catctgggca 


ttcatgttct 


tactctcttt 


gectaacatg 


430 


■■' ' ' ; > 




acaggcagcc 


gagagacaag 


aatgtgaaga 


aatgetet tt 


ccttaaat ca 


540 


.jag t teg 


qtC 


taqtctggca 


tgaaatagt a 


aattacatct 


gt caagt cat 


tttctggatt 


600 




r aa 


ttgttat tgt 


atgt tataca 


ctcattacaa 


aagaactgt a 


ccggtcatac 


660 


j t. a a g a a 


cga 


ggggtgtagg 


taaagtcccc 


aggaaaaagg 


tgaacgt caa 


agttttcatt 


720 




z 3 


tattctttat 


z tgttttgtt 


cct ttccatt 


tt geccgaat 


tcct tacacc 


780 




aaa 


cccgggatgt 


ct t tgac tgc 


act get gaaa 


atactct gt t 


ct at gtgaaa 


840 


i .a g a q c a 


etc 


tgtggttaac 


t t ccttaaat 


gcat gectgg 


at ccgtt cat 


ctattttttc 


900 




ccttcagaaa 


ttccttgata 


agt atget ga 


agtgccccaa 


tt ctgeaaca 


960 


- •. •* r; 




aggacaat ag 


gaaaaaagaa 


cagga t ggtg 


gtgacccaaa 


tgaagagact 


1020 



I0> 32 

n> 34 2 

12> : : ?.T 

I3> liomo sapiens 



y.^z 3 in Ala Vai Asp Asn Leu Thr Ser Ala Pro Gly Asn Thr Ser Leu 

L 5 10 15 

Vys Thr Arg Asp Tyr Lys lie Thr Gin Val Leu ?he Pro Leu Leu Tyr 

20 25 30 

Thr Vai Leu Phe Phe Val Gly Leu lie Thr Asn Gly Leu Ala Met Arg 

3 5 4 0 4 5 

lie Phe Phe Gin lie Arg Ser Lys Ser Asn Phe lie lie Phe Leu Lys 

50 55 60 

Asn Thr 'Val lie Ser Asp Leu Leu Met lie Leu Thr Phe Pro Phe Lys 

65 7 0 7 5 8 0 

He Leu Ser Asp Ala Lys Leu Gly Thr Gly Pro Leu Arg Thr Phe Val 
85 90 95 

Vys Gin Val Thr Ser Val lie Phe Tyr Phe Thr Met Tyr lie Ser lie 
10 0 10 5 110 

Ger Phe Leu Gly Leu lie Thr He Asp Arg Tyr Gin Lys Thr Thr Arg 
115 120 125 

Pro Phe Lys Thr Ser Asn Pro Lys Asn Leu Leu Gly Ala Lys He Leu 



i j i. 



135 140 



Vai Val lie Trp Ala Phe Met Phe Leu Leu Ser Leu Pro Asn Met 
150 155 160 
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He Leu Thr Asn Arg Gin Pro Arg Asp lys Asn Val Lys Lys Cys Ser 
16 5 17 0 17 5 

1 leu Lys Ser Glu Phe Gly Leu Val Trp His Glu He Val Asn Tyr 

18 0 18 5 190 

He Gys Gin Val lie Phe Trp He Asn Fhe Leu He Val He Val Cys 

195 200 205 

Tyr Ohr Leu He Thr Lys Glu Leu Tyr Arg Ser Tyr Val Arg Thr Arg 

HO 215 220 

Gly Val Gly Lys Val Pro Arg Lys Lys Val Asn Val Lys Val Phe He 
225 230 235 240 

He He Ala Val Phe Phe lie Cys Phe Val Pro Phe His Phe Ala Arg 
245 250 255 

in v vr:- Tyr Thr Leu Ser Gin Thr Arg Asp Val Phe Asp Cys Thr Ala 

260 265 270 

Glu Asn Thr Leu Phe Tyr Val Lys Glu Ser Thr Leu Trp Leu Thr Ser 
275 280 285 

Leu Asn Ala Cys Leu Asp Pro Phe lie Tyr Phe Phe Leu Cys Lys Ser 

290 295 300 

Phe Arg Asn Ser Leu He Ser Met Leu Lys Cys Pro Asn Ser Ala Thr 

305 310 315 320 

Ser Leu Ser Gin Asp Asn Arg Lys Lys Glu Gin Asp Gly Gly Asp Pro 
325 330 335 

Asn Glu Glu Thr Pro Met 

340 



•H 10;- 3 3 




■ ■' 2 1 3 > H o m o sapiens 
H00> 33 

a^gtcggtct gctaccgtcc cccagggaac 
gccacaggca cagccttcct gctgctggcg 
gtggtgtgga gcttggcggg ctggcggcct 
]H-l^cc tggcgctggc ogacggcgcg 



gagacactgc tgagctggaa gacttcgcgg 60 

gcgctgctgg ggctgcctgg caacggcttc 120 

gcacgggggc gaccgctggc ggccacgctt 180 

gtgctgctgc tcacgccgct ctttgtggcc 240 



33 






; -2 j ' ' 


g = ; 7" " 


gccgctgggc 


eaggeggget 


geaaggeggt 


gtactacgt g 


3 00 


' '?c-7c 


gctca 


geatgtaege 


cagcgtgctg 


ct caccggcc 


tget eagect 


geageget gc 


360 


-cqc 


agtca 


cccgcccctt 


cctggcgcct 


cggctgcgca 


gcccggccct 


ggcccgccgc 


420 


■Z get 


gctgg 


eggtctgget 


ggccgccct g 


ttgetcgccg 


tcccggccgc 


cgtctaccgc 


480 


: a c c t 


gtgga 


gggaccgcgt 


at gecaget g 


tgecacccgt 


cgccggt cca 


cgccgccgcc 


540 


" -"i .. c t 


^agee 


t ggagact ct 


gaeegcttte 


gtgett eett 


t egg get gat 


get eggctgc 


600 


' ^G 


ogtga 


c g c t g g c a c g 


getgegggge 


gcccgctggg 


get ccgggcg 


geaeggggeg 


660 


cgqgt 


gggee 


ggctggtgag 


cgccatcgt g 


ct tgectteg 


gettgetctg 


ggccccct ac 


720 


• ^ ' • •• 


a g t - a 


acctt ct gca 


ggeggtcgea 


gcgctggct c 


caceggaagg 


ggccttggcg 


780 


aagct 


gggcg 


g a g c c g g c c a 


gg egg eg eg a 


gegggaact a 


eggect tggc 


cttcttcagt 


840 




egtea 


a cc eg gt get 


etaegt cttc 


acegctggag 


at ct getgee 


ccgggcaggt 


900 


ccccgtttcc 


tcacgcggct 


ct tcgaaggc 


t ct ggggagg 


cccgaggggg 


cggccgct ct 


960 


,-igggaaggga 


oca tggagct 


ccgaact ace 


cctcagctga 


aagtggtggg 


gcagggccgc 


1020 


jgcaa 


tggag 


aeccgggggg 


tgggatggag 


aaggacggt c 


cggaatggga 


cctttga 


1077 



•:'210> 3 4 

■■■■ 2 11> 3 58 

•'■'12 > FPVT 

• 2 1 3 > Homo sapiens 

•-.'4 O0> 3 4 

Met Val Cys Tyr Arg Pro Pro Gly Asn Glu Thr Leu Leu Ser Trp 

5 10 15 

Lys Tr.r Ser Arg Ala Thr Gly Thr Ala Phe Leu Leu Leu Ala Ala Leu 

2 0 2 5 30 

Lc-u Gly Leu Pro Gly Asn Gly Phe Val Val Trp Ser Leu Ala Gly Trp 
35 40 45 

Arg Pro Ala Arg Gly Arg Pro Leu Ala Ala Thr Leu Val Leu His Leu 

50 55 60 

Ala Leu Ala Asp Gly Ala Val Leu Leu Leu Thr Pro Leu Phe Val Ala 

65 70 75 80 

Phe Leu Thr Arg Gin Ala Trp Pro Leu Gly Gin Ala Gly Cys Lys Ala 

3 5 90 95 

Val Tyr Tyr Val Cys Ala Leu Ser Met Tyr Ala Ser Val Leu Leu Thr 

100 105 110 

Gly Leu Leu Ser Leu Gin Arg Cys Leu Ala Val Thr Arg Pro Phe Leu 

115 120 12 5 

Ala Pro Arg Leu Arg Ser Pro Ala Leu Ala Arg Arg Leu Leu Leu Ala 
130 135 140 

3 9 



7a 1 Trp Leu Ala Ala Leu Leu Leu Ala Val Pro Ala Ala Val Tyr Arg 
14 5 150 155 160 

His Leu Trp Arg Asp Arg Val Cys Gin Leu Cys His Pro Ser Pro Val 

165 170 175 

His Ala Ala Ala His Leu Ser Leu Glu Thr Leu Thr Ala Phe Val Leu 
16 0 18 5 190 

Fro Phe Giy Leu Met Leu Gly Gys Tyr Ser Val Thr Leu Ala Arg Leu 
195 200 205 

Arg Giy Ala Arg Trp Gly Ser Gly Arg His Gly Ala Arg Val Gly Arg 
210 215 2 2 0 

Leu Val Ser Ala lie Val Leu Ala Phe Gly Leu Leu Trp Ala Pro Tyr 

225 230 235 240 

v.r y His Ala Val Asn Leu Leu Gin Ala Val Ala Ala Leu Ala Pro Pro Glu 

' ' \ 245 250 255 

r 



Gly Ala Leu Ala Lys Leu Gly Gly Ala Gly Gin Ala Ala Arg Ala Gly 

2 60 2 65 27 0 

Thr Thr Ala Leu Ala Phe Phe Ser Ser Ser Val Asn Pro Val Leu Tyr 

2^5 230 285 

Val Phe Thr Ala Giy Asp Leu Leu Pro Arg Ala Gly Pro Arg Phe Leu 

290 295 300 

Thr Arg Leu Phe Glu Gly Ser Gly Glu Ala Arg Gly Gly Gly Arg Ser 
3C5 310 315 320 

Arg Glu Gly Thr Met Glu Leu Arg Thr Thr Pro Gin Leu Lys Val Val 

325 330 335 

Giy Gin Gly Arg Gly Asn Gly Asp Pro Gly Gly Gly Met Glu Lys Asp 
340 345 350 

Giy Pro Glu Trp Asp Leu 

35 5 



■.2 10> 3 5 
<2 11> 100 5 
<2 12> DMA 

<2 13'> Homo sapiens 



40 



■itgctgggga 


t ca tggcat g 


gaatgeaact 


tgcaaaaact 


ggctggcagc 


agaggctgee 


60 


; z g q a a a a q t 


actacctttc 


cattttttat 


gggat t gagt 


tegtt gt ggg 


agtcct tgga 


120 


aataccattg 


t tgtttacgg 


ct acatcttc 


tct ctgaaga 


act ggaacag 


cagt aatatt 


130 


• atctcttta 


acct ctctgt 


ct ctgact ta 


get z t tctgt 


gcaccct ccc 


catgetgata 


240 


i ^ 7 a g t 1 a t g 


ccaat ggaaa 


ct ggata tat 


ggagacgtgc 


tetgeat aag 


caaccgat at 


300 


gtqcttcatg 


ccaacct eta 


t accagca 1 1 


ct ctttctca 


ctttt atcag 


cat agat cga 


360 


t a rtt^ataa 


"taagtatcc 


tttccgagaa 


caccttctgc 


aaaagaaaga 


gtttgetatt 


420 




t ggccatttg 


ggttt tagta 


acct tagagt 


tactacccat 


act tcccctt 


4 80 


i t a a a t c c t g 


1 1 ataactga 


caatggcacc 


acct gtaat g 


attt tgcaag 


tt ctggagac 


540 


.:c,:aactaca 


acctcattta 


cagcatgtgt 


ct aacactgt 


tggggttcct 


tattcctctt 


600 


tttgtgatgt 


gt ttctttta 


ttacaagatt 


gctctct tec 


taaagcagag 


gaat aggcag 


660 


gttgct actg 


cc:tgcccct 


tgaaaagect 


ct caact tgg 


teat catggc 


agtggt aatc 


720 


ttctctgtgc 


t tt ttacacc 


ct at cacgt c 


at gcggaat g 


tgaggatege 


tt cacgcctg 


780 


gggagt tgga 


agcagt atca 


gt gcact cag 


gt egtcatea 


actcctttta 


cattgt gaca 


840 


oggect ttgg 


cctttctgaa 


cagtgt catc 


aaccctgtct 


tctattttct 


tt tgggagat 


900 


.-acttcaqgg 


acatgetgat 


gaatcaactg 


agacacaact 


tcaaatccct 


tacatcctt t 


960 


sgcaqatggg 


ctcatgaact 


cctactt tea 


ttcagagaaa 


agtga 




1005 



■ 210> 3 6 
■- 2 1 1 > 334 
•;:]2> PRT 

•■: 2 1 3> Homo sapiens 
<400> 36 

Ket Leu Gly He Met Ala Trp Asn Ala Thr Cys Lys Asn Trp Leu Ala 
15 10 15 

Ala Glu Ala Ala Leu Glu Lys Tyr Tyr Leu Ser lie Phe Tyr Gly He 

20 25 30 

Glu Phe Val Val Gly Val Leu Gly Asn Thr He Val Val Tyr Gly Tyr 
35 4 0 4 5 

lie Phe Ser Leu Lys Asn Trp Asn Ser Ser Asn He Tyr Leu Phe Asn 

50 55 60 

Leu Ser Val Ser Asp Leu Ala Phe Leu Cys Thr Leu Pro Met Leu He 

6- 70 75 80 

Arq Ser Tyr Ala Asn Gly Asn Trp He Tyr Gly Asp Val Leu Cys He 
85 90 95 

Ser Asn Arg Tyr Val Leu His Ala Asn Leu Tyr Thr Ser He Leu Phe 
100 105 110 



4 1 



Leu Thr Phe lie Ser lie Asp Arg Tyr Leu lie lie Lys Tyr Pro Fhe 
115 120 125 

Arg Glu His Leu Leu Gin Lys Lys Glu Phe Ala lie Leu lie Ser Leu 
130 135 140 

Ala lie Trp vai Leu Vai Thr Leu Glu Leu Leu Pro lie Leu Pro Leu 
14 5 150 155 160 

lie Asr. Pro Vai lie Thr Asp Asn Gly Thr Thr Cys Asn Asp Phe Ala 
165 170 175 

ler Ser Gly Asp Pro Asn Tyr Asn Leu He Tyr Ser Met Cys Leu Thr 
ISO 185 190 

Leu Leu Gly Phe Leu lie Pro Leu Phe Vai Met Cys Phe Phe Tyr Tyr 

195 200 205 

Lys lie Aid Leu Phe Leu Lys Gin Arg Asn Arg Gin Vai Ala Thr Ala 

210 215 220 

Leu Pro Leu Glu Lys Pro Leu Asn Leu Vai lie Met Ala Vai Vai lie 
225 230 235 240 

Phe Ser Vai Leu Phe Thr Pro Tyr His Vai Met Arg Asn Vai Arg lie 
245 250 255 

Ala Ser Arg Leu Gly Ser Trp Lys Gin Tyr Gin Cys Thr Gin Vai Vai 
260 265 270 

lie Asn Ser Phe Tyr lie Vai Thr Arg Pro Leu Ala Phe Leu Asn Ser 

275 230 285 

Vai lie Asn Pro Vai Phe Tyr Phe Leu Leu Gly Asp His Phe Arg Asp 
290 295 300 

Met Leu Met Asn Gin Leu Arg His Asn Phe Lys Ser Leu Thr Ser Phe 

305 310 315 320 

Ser Arg Trp Ala His Glu Leu Leu Leu Ser Phe Arg Glu Lys 

325 330 



<210> 37 

2 1 1 > 12 9 6 

< 2 1 2 > DNA 

<213> Homo sapiens 



42 



■■i t 


gcaggegc 


::aaca:tac 


cccggagcag 


ttctctcggc 


tgctgcggga 


ccacaacctg 


60 


i C 


gcgggagc 


agtt catcgc 


t ct gt accgg 


ctgcgaccgc 


tegt ctacac 


cccagagctg 


120 


1 ' 71 


gggacgcg 


ccaagctggc 


cct cgtgct c 


accggcgtgc 


teat ct tege 


cctggcgctc 


130 






ctctggtgt t 


ctacgtggtg 


acccgcagca 


aggecatgeg 


caccgt cacc 


240 




• : • - - ~ i 


tctgct cctt 


ggcgctcagt 


gacct gctca 


tcacct tctt 


ctgeat tccc 


300 




?. a : t. g c 


tccagaacat 


ttccgacaac 


tggctggggg 


gt gctt teat 


t tgcaagatg 


360 




gocatttg 


tccagtctac 


cgctgt tgtg 


acagaaatgc 


t cact at gac 


ctgeat tget 


420 


} - 


ggaaaggc 


aceagggact 


t gt gcat cct 


1 1 1 aaaat ga 


agtggcaat a 


caccaaccga 


430 


agggctttea 


caatgctagg 


t gt ggtc tgg 


ctggt ggcag 


t categtagg 


atcacccatg 


540 




gcacgtgo 


a a c a a c 1 1 g a 


gat caaata t 


gactt cctat 


at gaaaagga 


acacat ct gc 


600 


' J 


ctt agaag 


agt ggaccag 


ccctgtgcac 


cagaaga tct 


acaccacct t 


catcct tgtc 


660 


■ ; t 


< : c t c 1 1 c c 


tcctgcct ct 


t at ggtgat g 


cttattctgt 


acagtaaaat 


tggt ta tgaa 


720 


■ - +- 


ttggataa 


agaaaagagt 


tggggatggt 


t cagt gctt c 


gaactatt ca 


tggaaaagaa 


780 


2 t 


qtccaaaa 


t agccaggaa 


gaagaaacga 


get gt catt a 


tgat ggtgac 


agtggt ggct 


840 


t 


ctt t gctg 


tgtgctgggc 


accat tccat 


gtt gt ccata 


tgat gatt ga 


atacagtaat 


900 


f r 


t qaaaaqg 


aatatgatga 


tgt cacaat c 


aagatgattt 


ttgctatcgt 


gcaaat tatt 


960 


g gat tt tcca 


act ccatctg 


t aa tcccat t 


gtctatgea t 


tt at gaatga 


aaactt caaa 


1020 


a a 


aaatgttt 


t gt ctgcagt 


t tgttat tgc 


atagtaaat a 


aaacctt ct c 


tccagcacaa 


1080 


7 


gcatggaa 


a 1 1 caggaa t 


tacaat gatg 


eggaagaaag 


caaagtt t tc 


cctcagagag 


1140 


a a 


t cc3 g tgg 


aggaaaccaa 


aggagaagca 


1 1 cagtgatg 


gcaacatt ga 


agtcaaat tg 


1200 


\ g t g a a c a g a 


cagaggagaa 


gaaaaagct c 


aaacgacat c 


tt gctctctt 


taggtc tgaa 


1260 




ggctgaga 


at tctccttt 


agacagtggg 


cat taa 






1296 



2 1 0 > 3 8 
<■; 2 : 1 > 4 31 
<212> PRT 

<213> He mo sapiens 

< 4 0 0 > 3 3 

Met Gin Ala Leu Asn lie Thr Pro Glu Gin Phe Ser Arg Leu Leu Arg 
1 5 10 15 

Asp His Asn Leu Thr Arg Glu Gin Phe lie Ala Leu Tyr Arg Leu Arg 

20 25 30 

Pre Leu Val Tyr Thr Pro Glu Leu Pro Giy Arg Ala Lys Leu Ala Leu 
35 40 45 

Val Leu Thr Gly Val Leu lie Phe Ala Leu Ala Leu Phe Gly Asn Ala 

50 55 60 

Leu Val Phe Tyr Val Val Thr Arg Ser Lys Ala Met Arg Thr Val Thr 

6 5 70 75 80 

Asn lie Phe lie Cys Ser Leu Ala Leu Ser Asp Leu Leu lie Thr Phe 

85 90 95 



43 



Phe Cys lie Pro Val .hr Met Leu Gin Asn He Ser Asp Asn Trp Leu 

10 0 10 5 110 

Gly Gly Ala Phe lie Cys Lys Met Val Pro Phe Val Gin Ser Thr Ala 
115 120 125 

Val Val Thr Glu Met Leu Thr Met Thr Cys He Ala Val Glu Arg His 
13C 135 140 

GLr. Giy Leu Val His Pro Phe Lys Met Lys Trp Gin Tyr Thr Asn Arg 
145 150 155 160 

Arg Ala Phe Thr Met Leu Gly Val Val Trp Leu Val Ala Val He Val 
16 5 17 0 17 5 

Giy Ser Pro Met Trp His Val Gin Gin Leu Glu He Lys Tyr Asp Phe 
130 13 5 190 

Lou Tyr- Oiu Lys Glu His He Cys Cys Leu Glu Glu Trp Thr Ser Pro 

195 200 205 

Val His Gin Lys He Tyr Thr Thr Phe He Leu Val lie Leu Phe Leu 

21 0 215 22 0 

Leu Pro Leu Met Vai Met Leu lie Leu Tyr Ser Lys lie Gly Tyr Glu 

22 5 2 30 23 5 24 0 

Leu Trp He Lys Lys Arg Val Gly Asp Gly Ser Val Leu Arg Thr He 
245 250 255 

His Giy Lys Glu Met Ser Lys lie Ala Arg Lys Lys Lys Arg Ala Val 
260 265 270 

He Met Met Val Thr Val Val Ala Leu Phe Ala Val Cys Trp Ala Pro 

275 280 285 

Phe His Val Val His Met Met He Glu Tyr Ser Asn Phe Glu Lys Glu 

230 295 3C0 

Tyr Asp Asp Val Thr lie Lys Met lie Phe Ala He Val Gin lie lie 
H5 310 315 320 

Giy Phe Ser Asn Ser He Cys Asn Pro lie Val Tyr Ala Phe Met Asn 
325 330 335 

Glu Asn Phe Lys Lys Asn Val Leu Ser Ala Val Cys Tyr Cys He Val 
340 345 350 



44 




Asn Lys Thr Phe Ser Pro Ala Gin Arg His Gly Asn Ser Gly lie Thr 

3 55 3 60 3 65 

Met Met Arg Lys Lys Ala Lys Phe Ser Leu Arg Glu Asn Pro Val Glu 

3^0 375 380 

Gin Thr Lys Gly Glu Ala Phe Ser Asp Gly Asn lie Glu Val Lys Leu 

^35 39C 395 400 

Gys Glu Gin Thr Glu Glu Lys Lys Lys Leu Lys Arg His Leu Ala Leu 
4 0 5 4 10 415 

Phe Arg Ser Glu Leu Ala Glu Asn Ser Pro Leu Asp Ser Gly His 
420 425 430 



:210> 3 9 
■ 2 1 1 > 24 
.2 12 > DNA 

: 2 1 3 > H omo sapiens 
4 00> 39 

^gtg^acag cagttcgcag agtg 24 



< 2 1 1 > 2 4 

<212> DNA 

<213> Hcrr.o sapiens 

■-400> 40 

gagtgccagg cagagcaggt agac 24 



:210> 41 

: 2 1 1> 31 

212> DNA 

"213> iic::o sapiens 



: 4 0 0 > 4 1 

:ccgaat_cc tgcttgctcc cagcttggcc c 31 



•'210> 4 2 

<2U> 3 2 

•^212> DNA 

■■'213 > Homo 



sapiens 



45 



4 . > ~t L 

tgtggatcct gctgtcaaag gtcccattcc gg 



32 



2 c 

■ 712 .> DNA 

. 21 3> Ho.nc sapiens 



4 00> 4 3 

,;a:aatgct aggtgtggt: 



20 



r / 



<2i:>> 4 4 

- i : ; • 22 

<212> DNA 

<213> Homo sapiens 
■■.4 20> 4 4 

^ji-j^aqaca atgggattac ag 22 



■ ■ 2 I 0 > 4 5 

'■'2 11 > 511 

<212> DNA 

2 1 3 > Homo sapiens 

<400> 45 

tcacaatqct aggtgcggtc tggctggtgg cagtcatcgt aggatcaccc atgtggcacg 60 

tgcaacaact tgagatcaaa tatgacttcc tatatgaaaa ggaacacatc tgctgcttag 120 

aagagtggac cagccctgtg caccagaaga tctacaccac cttcatcctt gtcatcctct 180 

ticjtcctgcc tcttatggtg atgcttattc tgtacgtaaa attggttatg aactttggat 240 

aaagaaaaga gttggggatg gttcagtgct tcgaactatt catggaaaag aaatgtccaa 300 

aatagccagg aagaagaaac gagctgtcat tatgatggtg acagtggtgg ctctctttgc 360 

tigtgtgctgg gcaccattcc atgttgtcca tatgatgatt gaatacagta attttgaaaa 420 

ggaatacgat gatgtcacaa tcaagatgat ttttgctatc gtgcaaatta ttggattttc 480 

oaact ocatc tgtaatccca ttgtctatgc a 511 



■21C> 4 6 

.■;n;> 21 

:212> DNA 

'213 > Homo sapiens 
:4 00> 4 6 

;tigcttagaa gaglggacca g 



46 



< 210 > 4 7 

<2il> 22 

■ 212> CNA 

<213> Homo sapiens 



-rtgtgcacca gaagatctac ac 



22 



.-2I0> 4 8 
:211> 21 
:212> DNA 

■ 2 1 "3 > Homo sapiens 
;400> 4 3 

;aaggatgaa ggtggtgtag a 21 



2 10> 4 9 
:21L> 2 3 



2 1 3 > Homo sapiens 



4 00> 4 9 
g^gt agate 



:ctggtgcac agg 



23 



<21G> 50 

-211 > 21 

< 2 1 2 > DNA 

2 1 3 > Homo sapiens 

< 4 30> 50 

geaatgeagg tcatagtgag c 21 



2 10> 51 

■211> 27 

■■212 > DNA 

2 1 3 > Homo sapiens 



■■■4 7Q-> 51 

r.jgagcatgg tgacgggaat gcagaag 27 



" 2 1 0 > 52 



■:.-4 00> 52 

jngatgagca ggtcactgag cgccaag 27 



-■2:0.- 53 

•■■:2ll> 2 3 

-.2I2:> DNA 

<2:5> Home sapiens 

•'4 00> 5 3 

goaatqcagg cgcttaacat tac 23 



^213-> 5 4 
;211> 2 2 
- 2 1 2 > DNA 

■ 213 > Home sapiens 
MC0> 5 4 

-.tgggrtaca atctgaaggg ca 22 



:210.> 5 5 
.21 1> 2 3 
:212> DNA 

-'213 > Homo sapiens 

' 4 0 C > 5 5 

}c r eogtgtc cagcaggact ctg 23 



:210> 56 
:2ll> 24 
:212> DNA 

■ 213 > Homo sapiens 

. 4 0 T> 5 6 

ig-ogtgttcc tggaccctca cgtg 24 



: 2 1 1 > 2 9 
' 2 1 2 > DNA 

■. 2 1 3 > Homo sapiens 
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raygccttgg attttaatgt cagggatgg 



DN'A 

'21 3> Homo sapiens 
4C0> 58 

gagagtcag ctctgaaaga attcagg 



210> 5 9 

2 1 1 > 21 

212> DNA 

21 3> Homo 



saoiens 



4 00> 5 9 

gatgtgatg ocagatacta atagcac 



;-.ic> 63 

2 1 1 > 2 / 
212> DNA 

213> Homo sapiens 
400> 60 

ctgattcat ttaggtgaga ttgagac 



210 > 61 
2 1 1 > 21 
2 12> DNA 

2 1 3 > Homo sapiens 

4 0Q> 61 

acaggtacc ttgccatcaa g 



21G> 62 
21 1> 22 

2 1 2 > DNA 

2 1 3 > Homo sapiens 
4 00> 62 

vgcacaatg ccagtgataa gg 



49 



:2I0> 63 
. 2 1 1 > 2 7 
: 2 1 2> DNA 

: 2 1 3 > Homo sapiens 

■ 0 0 > 6 3 

; r gacr_ t ct t gttcctggca gcagcgg 27 



■'2 10> 6 4 
<2U> 2 7 
<212> DNA 

<213> Homo sapiens 
<400> 64 

agaccagcca gggcacgctg aagagtg 27 



• ■■ 2 1 0 > 6 5 

2 1 1 > 32 

•■.-212> DNA 

2 I 3 '"> Heme sapiens 

<4 0C> 6 5 

qatcaagctt ccatcctact gaaaccatgg tc 32 



<210> 6 6 
<2I1> 3 5 
<2\2> DNA 

- ■ 2 1 3 > Home sapiens 
<4 00> 66 

gatcagatct cagttccaat attcacacca ccgtc 35 



■ 2 1 C> 67 

■ 2 1 1 > 22 
2\2> DNA 

■ 2 1 3 > Homo sapiens 
■4G2> 67 

:tggtgtgct ccatggcatc cc 22 



210> 6 8 
2I1> 22 
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■ ■ 2 1 3 > Homo sapiens 
■■. 4C0> 63 

.jtaagcctcc caqaacqaqa gg 




<21C> 6 9 
<■ 2 1 I > 2 4 
-212> DNA 

<213> Homo sapiens 
•:4 00> 69 

oagcgcaggg tgaagcctga gage 



i. 



'J 



s.21Q > ^0 
< 2 1 1 > 2 4 
<212> DNA 



>. • . 2: 1 i > Homo sapiens 



f{\ jgcacctgct gtgaectgtg cagg 



24 



<210> 71 
'■211> 22 
2 : 2 > DNA 

2 1 3 > Homo sapiens 

4 0 0 > 7 1 

gtcctgocac -tcgagacat gg 22 



■21C> 72 
'21 1> 2 3 
: 2 1 2 > DNA 

o:13> Homo sapiens 
400 ^2 

23 



jaaacttctc tgcccttacc gtc 



■■■21Q> 7 3 
-.■211> 2 6 
<2I2> DNA 

*- 2 1 3 > Homo sapiens 
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- -■ 4 C 0 > "'3 

■,:caacaccag catccatggc atcaag 



< 2 I 0> 74 

■-;:;:> 2 7 

212> DNA 

2 1 3 > Homo sapiens 
<4G0> 74 

^gagagtcag ctctgaaaga attcagg 



52 
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SEQUENCE LISTING 

/I 

(1) GENERAL INFORMATION: 

(i)\ APPLICANT : Chen, Ruoping 
\ Dang, Huong T. 

5 \ Li aw, Chen W. 

Lin, I -Lin 

(ii) TITLES OF INVENTION: Human Orphan G Protein- Coupled Receptors 
(iii) NUMBE^ OF SEQUENCES: 74 

10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Arena Pharmaceuticals, Inc. 

(B) STREET: 6166 Nancy Ridge Drive 

(C) CITY :\ San Diego 

(D) STATE t CA 

15 (E) COUNTR*: USA 

(F) ZIP: 92121 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

25 (C) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. 

(B) REGISTRATION NUMBER; 34,787 

(ix) TELECOMMUNICATION INFORMATION: 
30 (A) TELEPHONE : (858)453-7200 

(B) TELEFAX: (858)453-7210 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1260 base pairs 

35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
40 ATGGTCTTCT CGGCAGTGTT GACTGCGTTC CATACCGGGA CATCCAACAC AACATTTGTC 60 
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gtgtatgA&a acacctacat gaatattaca ctccctccac cattccagca TCCTGACCTC 12 0 

AGTCCATTGC TTAGATATAG TTTTGAAAC C ATGGCTCCCA CTGGTTTGAG TTCCTTGACC 180 

\ 

GTGAATAGTA CAGCTGTGCC CACAACACCA GCAGCATTTA AG AG C CT AAA CTTGCCTCTT 240 

\ 

CAGATCACCC TTTvCTGCTAT AATGATATTC ATT CTGTTTG TGTCTTTTCT TGGGAACTTG 3 00 
5 GTTGTTTGCC TCAT^GTTTA C CAAAAAGCT GCCATGAGGT CTGCAATTAA CATCCTCCTT 3 60 
GCCAGCCTAG CTTTTGCAGA CATGTTGCTT GCAGTGCTGA ACATGCCCTT TGCCCTGGTA 42 0 
ACTATTCTTA CTACCCGATG GATTTTTGGG AAATTCTTCT GTAGGGTATC TGCTATGTTT 480 
TTCTGGTTAT TTGTGATAGA AGGAGTAGCC ATCCTGCTCA TCATTAGCAT AGATAGGTTC 54 0 
CTTATTATAG TCCAGAGGCA GGATAAGCTA AACCCATATA GAGCTAAGGT TCTGATTGCA 600 

10 GTTTCTTGGG C AACTTC CTT TTGTGTAGCT TTTC CTTTAG CCGTAGGAAA CCCCGACCTG 660 
CAGATACCTT CCCGAGCTCC CCAGTGTGTG TTTGGGTACA CAACCAATCC AGGCT AC CAG 72 0 
GCTTATGTGA TTTTGATTTC TCTCATTTCT TT CTT CAT AC CCTTCCTGGT AATACTGTAC 780 
TCATTTATGG GCATACTCAA CACCCTTCGG CACAATGC CT TGAGGATCCA TAGCTAC C CT 84 0 
GAAGGTATAT GCCTCAGCCA GGCCAGCAAA CTGGGTCTCA TGAGTCTGCA GAG AC CTTTC 90 0 

15 CAGATGAGCA TTGACATGGG CTTTAAAACA CGTGCCTT CA CCACTATTTT GATTCTCTTT 96 0 

GCTGTCTTCA TTGTCTGCTG GGCCCCATTC ACCACTTACA GCCTTGTGGC 
AACATTCAGTl 0 2 0 

AAGCACTTTT ACTATCAGCA CAACTTTTTT GAG ATT AG C A CCTGGCTACT GTGGCTCTGC 1080 

T AC CTCAAGT CTGCATTGAA TCCGCTGATC TACTACTGGA GGATTAAGAA ATTC C ATG AT 114 0 

20 GCTTGCCTGG ACATGATGCC TAAGTCCTTC AAGTTTTTGC CGCAGCTCCC TGGTCACACA12 0 0 

AAGCGACGGA TACGTCCTAG TGCTGTCTAT GTGTGTGGGG AACATCGGAC GGTGGTGTGA1 2 60 

(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

30 Met Val Phe Ser Ala Val Leu Thr Ala Phe His Thr Gly Thr Ser Asn 

15 10 15 
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^ Thr Thr Phe Val Val Tyr Glu Asn Thr Tyr Met Asn lie Thr Leu Pro 

I 1 r 20 25 30 

Pro Pro Phe Gin His Pro Asp Leu Ser Pro Leu Leu Arg Tyr Ser Phe 

i 3 5 4 0 4 5 

i 

5 Glu Thr Met Ala Pro Thr Gly Leu Ser Ser Leu Thr Val Asn Ser Thr 

50 55 60 

Ala Val Pro Thr Thr Pro Ala Ala Phe Lys Ser Leu Asn Leu Pro Leu 

i 65 70 75 80 

Gin lie Thr Leu Ser Ala lie Met lie Phe lie Leu Phe Val Ser Phe 
10 85 90 95 

Leu Gly Asn Leu Val Val Cys Leu Met Val Tyr Gin Lys Ala Ala Met 
100 105 110 



Arg Ser Ala lie Asn lie Leu Leu Ala Ser Leu Ala Phe Ala Asp Met 

115 120 125 

15 Leu Leu Ala Val Leu Asn Met Pro Phe Ala Leu Val Thr lie Leu Thr 

130 135 140 

Thr Arg Trp lie Phe Gly Lys Phe Phe Cys Arg Val Ser Ala Met Phe 

145 150 155 160 

Phe Trp Leu Phe Val lie Glu Gly Val Ala lie Leu Leu lie lie Ser 
20 165 170 175 

lie Asp Arg Phe Leu lie lie Val Gin Arg Gin Asp Lys Leu Asn Pro 
180 185 190 

Tyr Arg Ala Lys Val Leu lie Ala Val Ser Trp Ala Thr Ser Phe Cys 
195 200 205 

25 Val Ala Phe Pro Leu Ala Val Gly Asn Pro Asp Leu Gin He Pro Ser 

210 215 220 

Arg Ala Pro Gin Cys Val Phe Gly Tyr Thr Thr Asn Pro Gly Tyr Gin 
225 230 235 240 

Ala Tyr Val He Leu He Ser Leu He Ser Phe Phe He Pro Phe Leu 

30 245 250 255 

Val He Leu Tyr Ser Phe Met Gly He Leu Asn Thr Leu Arg His Asn 

260 265 270 

Ala Leu Arg He His Ser Tyr Pro Glu Gly He Cys Leu Ser Gin Ala 
275 280 285 

35 Ser Lys Leu Gly Leu Met Ser Leu Gin Arg Pro Phe Gin Met Ser He 

290 295 300 



Asp Met Gly Phe Lys Thr Arg Ala Phe Thr Thr He Leu He Leu Phe 
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305 310 315 320 

Ala Val Phe lie Val Cys Trp Ala Pro Phe Thr Thr Tyr Ser Leu Val 
325 330 335 

Ala Thr Phe Ser Lys His Phe Tyr Tyr Gin His Asn Phe Phe Glu lie 
5 340 345 350 

Ser Thr Trp Leu Leu Trp Leu Cys Tyr Leu Lys Ser Ala Leu Asn Pro 
355 360 365 

Leu lie Tyr Tyr Trp Arg lie Lys Lys Phe His Asp Ala Cys Leu Asp 
370 375 380 

10 Met Met Pro Lys Ser Phe Lys Phe Leu Pro Gin Leu Pro Gly His Thr 

385 390 395 400 

Lys Arg Arg lie Arg Pro Ser Ala Val Tyr Val Cys Gly Glu His Arg 
405 410 415 



Thr Val Val 



15 



(4) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGTTAGCCA ACAGCTCCTC AACCAACAGT TCTGTTCTCC CGTGTCCTGA CTACCGACCT 60 

25 ACCCACCGCC TGCACTTGGT GGTCTACAGC TTGGTGCTGG CTGCCGGGCT CCCCCTCAAC 12 0 
GCGCTAGCCC TCTGGGTCTT CCTGCGCGCG CTGCGCGTGC ACTCGGTGGT GAGCGTGTAC 180 
ATGTGTAACC TGGCGGCCAG CGACCTGCTC TTCACCCTCT CGCTGCCCGT TCGTCTCTCC 24 0 
TACTACGCAC TGCAC CACTG GCCCTTCCCC GACCTCCTGT GCCAGACGAC GGGCGCCATC 3 00 
TTCCAGATGA ACATGTACGG CAGCTGCATC TTCCTGATGC TCATCAACGT GGACCGCTAC 3 60 

30 GCCGCCATCG TGCACCCGCT GCGACTGCGC CACCTGCGGC GGCCCCGCGT GGCGCGGCTG 42 0 
CTCTGCCTGG GCGTGTGGGC GCTCATCCTG GTGTTTGCCG TGCCCGCCGC CCGCGTGCAC 480 
AGGCCCTCGC GTTGCCGCTA CCGGGACCTC GAGGTGCGCC TATGCTTCGA GAGCTTCAGC 54 0 
GACGAGCTGT GGAAAGGCAG GCTGCTGCCC CTCGTGCTGC TGGCCGAGGC GCTGGGCTTC 600 
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CTGCTGCCCC TGGCGGCGGT GGTCTACTCG TCGGGCCGAG TCTTCTGGAC GCTGGCGCGC 66 0 
CCCGACGCCA CGCAGAGCCA GCGGCGGCGG AAGACCGTGC GCCTCCTGCT GGCTAACCTC 72 0 
GTCATCTTCC TGCT^GTGCTT CGTGCCCTAC AACAGCACGC TGGCGGTCTA CGGGCTGCTG 780 
CGGAGCAAGC TGGTGtJCGGC CAGCGTGCCT GCCCGCGATC GCGTGCGCGG GGTGCTGATG 84 0 
5 GTGATGGTGC TGCTGGC^GG CGCCAACTGC GTGCTGGACC CGCTGGTGTA CTACTTTAGC 90 0 
GCCGAGGGCT TCCGCAACAC CCTGCGCGGC CTGGGCACTC CGCACCGGGC CAGGACCTCG 960 
GCCACCAACG GGACGCGGGC GGCGCTCGCG CAATCCGAAA GGTCCGCCGT CACCACCGAC 102 0 
GCCACCAGGC CGGATGCCGC CAGTCAGGGG CTGCTCCGAC CCTCCGACTC CCACTCTCTG 10 8 0 
TCTTC CTTCA CACAGTGTCC CCAGGATTCC GCCCTCTGA 1119 
10 (5) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 72 amino acids 

( B ) TYPE : amino ac id 

( C ) STRANDEDNESS : 

15 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Leu Ala Asn Ser Ser Ser Thr Aan Ser Ser Val Leu Pro Cys Pro 
15 10 15 

20 Asp Tyr Arg Pro Thr His Arg Leu His Leu Val Val Tyr Ser Leu Val 

20 25 30 

Leu Ala Ala Gly Leu Pro Leu Asn Ala Leu Ala Leu Trp Val Phe Leu 
35 40 45 

Arg Ala Leu Arg Val His Ser Val Val Ser Val Tyr Met Cys Asn Leu 
25 50 55 60 

Ala Ala Ser Asp Leu Leu Phe Thr Leu Ser Leu Pro Val Arg Leu Ser 

65 70 75 80 

Tyr Tyr Ala Leu His His Trp Pro Phe Pro Asp Leu Leu Cys Gin Thr 
85 90 95 

30 Thr Gly Ala lie Phe Gin Met Asn Met Tyr Gly Ser Cys He Phe Leu 

100 105 HO 

Met Leu He Asn Val Asp Arg Tyr Ala Ala He Val His Pro Leu Arg 
115 120 125 



AREN-0050 - 36 - PATENT 

Leu Arg His Leu Arg Arg Pro Arg Val Ala Arg Leu Leu Cys Leu Gly 
130 135 140 

Val Trp Ala Leu lie Leu Val Phe Ala Val Pro Ala Ala Arg Val His 
145 \ 150 155 160 

5 Arg Pro Ser Arg Cys Arg Tyr Arg Asp Leu Glu Val Arg Leu Cys Phe 

165 170 175 

Glu Ser Phe Ser Asp Glu Leu Trp Lys Gly Arg Leu Leu Pro Leu Val 
180 185 190 

Leu Leu Ala Glu Ala Leu Gly Phe Leu Leu Pro Leu Ala Ala Val Val 
10 195 200 205 

Tyr Ser Ser Gly Arg Val Phe Trp Thr Leu Ala Arg Pro Asp Ala Thr 
210 215 220 

Gin Ser Gin Arg Arg Arg Lys Thr Val Arg Leu Leu Leu Ala Asn Leu 
225 230 235 240 

15 Val lie Phe Leu Leu Cys Phe Val Pro Tyr Asn Ser Thr Leu Ala Val 

245 250 255 

Tyr Gly Leu Leu Arg Ser Lys Leu Val Ala Ala Ser Val Pro Ala Arg 
260 265 270 

Asp Arg Val Arg Gly Val Leu Met Val Met Val Leu Leu Ala Gly Ala 
20 275 280 285 

Asn Cys Val Leu Asp Pro Leu Val Tyr Tyr Phe Ser Ala Glu Gly Phe 
290 295 300 

Arg Asn Thr Leu Arg Gly Leu Gly Thr Pro His Arg Ala Arg Thr Ser 
305 310 315 320 

25 Ala Thr Asn Gly Thr Arg Ala Ala Leu Ala Gin Ser Glu Arg Ser Ala 

325 330 335 

Val Thr Thr Asp Ala Thr Arg Pro Asp Ala Ala Ser Gin Gly Leu Leu 
340 345 350 

Arg Pro Ser Asp Ser His Ser Leu Ser Ser Phe Thr Gin Cys Pro Gin 
30 355 360 365 

Asp Ser Ala Leu 
370 

(6) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
( D } TOPOLOGY : linear 
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(j.i) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGGCCAACT CCACAGGGCT GAACGCCTCA GAAGTCGCAG GCTCGTTGGG GTTGATCCTG 6 0 
GCAGCTGTCG TGGAGGTGGG GGCACTGCTG GGCAACGGCG CGCTGCTGGT CGTGGTGCTG 12 0 
5 CGCACGCCGG GACTGCGCGA CGCGCTCTAC CTGGCGCACC TGTGCGTCGT GGACCTGCTG 18 0 
GCGGCCGCCT CCATCATGCC GCTGGGCCTG CTGGCCGCAC CGCCGCCCGG GCTGGGCCGC 240 
GTGCGCCTGG GCCCCGCGCC ATGCCGCGCC GCTCGCTTCC TCTCCGCCGC TCTGCTGCCG 3 00 
GCCTGCACGC TCGGGGTGGC CGCACTTGGC CTGGCACGCT ACCGCCTCAT CGTGCACCCG 3 60 
CTGCGGCCAG GCTCGCGGCC GCCGCCTGTG CTCGTGCTCA CCGCCGTGTG GGCCGCGGCG 42 0 

1 0 GGACTGCTGG GCGCGCTCTC CCTGCTCGGC CCGCCGCCCG CACCGCCCCC TGCTCCTGCT 4 80 
CGCTGCTCGG TCCTGGCTGG GGGCCTCGGG CCCTTCCGGC CGCTCTGGGC CCTGCTGGCC 54 0 
TTCGCGCTGC CCGCCCTCCT GCTGCTCGGC GCCTACGGCG GCATCTTCGT GGTGGCGCGT 600 
CGCGCTGCCC TGAGGCCCCC ACGGCCGGCG CGCGGGTCCC GACTCCGCTC GGACT CTCTG 660 
GATAGCCGCC TTTCCATCTT GCCGCCGCTC CGGCCTCGCC TGCCCGGGGG CAAGGCGGCC 72 0 

15 CTGGCCCCAG CGCTGGCCGT GGGCCAATTT GCAGCCTGCT GGCTGCCTTA TGGCTGCGCG 7 80 
TGCCTGGCGC CCGCAGCGCG GGCCGCGGAA GCCGAAGCGG CTGTCACCTG GGTCGCCTAC 84 0 
TCGGCCTTCG CGGCTCACCC CTTCCTGTAC GGGCTGCTGC AGCGCCCCGT GCGCTTGGCA 900 
CTGGGCCGCC TCTCTCGCCG TGCACTGCCT GGACCTGTGC GGGCCTGCAC TCCGCAAGCC 96 0 
TGGCACCCGC GGGCACTCTT GCAATGCCTC CAGAGACCCC CAGAGGGCCC TGCCGTAGGC102 0 

20 CCTTCTGAGG CTCCAGAACA GACCCCCGAG TTGGCAGGAG GGCGGAGCCC CGCATACCAG10 8 0 
GGGCCACCTG AGAGTTCTCT CTCCTGA 1107 
(7) INFORMATION FOR SEQ ID NO : 6 : 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 68 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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Met Ala Asn Ser Thr Gly Leu Asn Ala Ser Glu Val Ala Gly Ser Leu 
15 10 15 

Gly Leu He Leu Ala Ala Val Val Glu Val Gly Ala Leu Leu Gly Asn 
20 25 30 

5 Gly Ala Leu Leu Val Val Val Leu Arg Thr Pro Gly Leu Arg Asp Ala 

35 40 45 

Leu Tyr Leu Ala H^Ls Leu Cys Val Val Asp Leu Leu Ala Ala Ala Ser 
50 55 60 

lie Met Pro Leu Gly Leu Leu Ala Ala Pro Pro Pro Gly Leu Gly Arg 
10 65 70 75 80 

Val Arg Leu Gly Pro Ala Pro Cys Arg Ala Ala Arg Phe Leu Ser Ala 
85 90 95 

Ala Leu Leu Pro Ala Cys Thr Leu Gly Val Ala Ala Leu Gly Leu Ala 
100 105 110 

15 Arg Tyr Arg Leu He Val His Pro Leu Arg Pro Gly Ser Arg Pro Pro 

115 120 125 

Pro Val Leu Val Leu Thr Ala Val Trp Ala Ala Ala Gly Leu Leu Gly 
130 135 140 

Ala Leu Ser Leu Leu Gly Pro Pro Pro Ala Pro Pro Pro Ala Pro Ala 
20 145 150 155 160 

Arg Cys Ser Val Leu Ala Gly Gly Leu Gly Pro Phe Arg Pro Leu Trp 
165 170 175 

Ala Leu Leu Ala Phe Ala Leu Pro Ala Leu Leu Leu Leu Gly Ala Tyr 
180 185 190 

25 Gly Gly He Phe Val Val Ala Arg Arg Ala Ala Leu Arg Pro Pro Arg 

195 200 205 

Pro Ala Arg Gly Ser Arg Leu Arg Ser Asp Ser Leu Asp Ser Arg Leu 
210 215 220 

Ser He Leu Pro Pro Leu Arg Pro Arg Leu Pro Gly Gly Lys Ala Ala 
30 225 230 235 240 

Leu Ala Pro Ala Leu Ala Val Gly Gin Phe Ala Ala Cys Trp Leu Pro 
245 250 255 

Tyr Gly Cys Ala Cys Leu Ala Pro Ala Ala Arg Ala Ala Glu Ala Glu 
260 265 270 

35 Ala Ala Val Thr Trp Val Ala Tyr Ser Ala Phe Ala Ala His Pro Phe 

275 280 285 



Leu Tyr Gly Leu Leu Gin Arg Pro Val Arg Leu Ala Leu Gly Arg Leu 
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2^0 295 300 

Ser Arg Arg Ala Leu Pro Gly Pro Val Arg Ala Cys Thr Pro Gin Ala 
305 \ 310 315 320 

\ 

Trp His PrdXArg Ala Leu Leu Gin Cys Leu Gin Arg Pro Pro Glu Gly 
5 \ 325 330 335 

Pro Ala Val Gly Pro Ser Glu Ala Pro Glu Gin Thr Pro Glu Leu Ala 
340\ 345 350 

Gly Gly Arg Ser P*:o Ala Tyr Gin Gly Pro Pro Glu Ser Ser Leu Ser 
355 360 365 

10 (8) INFORMATION FOR SEQ I'D NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGGAATCAT CTTTCTCATT TGGAGTGATC CTTGCTGTCC TGGCCTCCCT CATCATTGCT 60 

ACTAACACAC TAGTGGCTGT GGCTGTGCTG CTGTTGATCC ACAAGAATGA TGGTGTCAGT 12 0 

20 CTCTGCTTCA CCTTGAATCT GGCTGTGGCT GACACCTTGA TTGGTGTGGC CATCTCTGGC 180 

CTACTCACAG ACCAGCTCTC CAGCCCTTCT CGGCCCACAC AGAAGAC CCT GTGCAGCCTG 24 0 

CGGATGGCAT TTGTCACTTC CTCCGCAGCT GCCTCTGTCC TCACGGTCAT GCTGATCACC 3 00 

TTTGACAGGT ACCTTGCCAT CAAGCAGCCC TTCCGCTACT TOAAGATCAT GAGTGGGTTC 360 

GTGGCCGGGG CCTGCATTGC CGGGCTGTGG TTAGTGTCTT ACCTCATTGG CTTCCTCCCA 42 0 

25 CTCGGAATCC CCATGTTCCA GCAGACTGCC TACAAAGGGC AGTGCAGCTT CTTTGCTGTA 4 8 0 

TTTCACCCTC ACTTCGTGCT GACCCTCTCC TGCGTTGGCT TCTTCCCAGC CATGCTCCTC 54 0 

TTTGTCTTCT TCTACTGCGA CATG CTCAAG ATTGCCT CCA TGCACAGCCA GCAGATTCGA 600 

AAGATGGAAC ATGCAGGAGC CATGGCTGGA GGTTATCGAT CCCCACGGAC TCCCAGCGAC 660 

TTCAAAGCTC TCCGTACTGT GTCTGTTCTC ATTGGGAGCT TTGCTCTATC CTGGACCCCC 72 0 

30 TTCCTTATCA CTGGCATTGT GCAGGTGGCC TGCCAGGAGT GTCACCTCTA CCTAGTGCTG 780 

G AACGGT AC C TGTGGCTGCT CGGCGTGGGC AACTCCCTGC TCAACCCACT CATCTATGCC 84 0 
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TATTGGCAGA AGGAGGTGCG ACTGCAGCTC TACCACATGG CCCTAGGAGT GAAGAAGGTG 90 0 
CTCACCTCAT TCCTCCTCTT TCTCTCGGCC AGGAATTGTG G C C C AG AG AG GCCCAGGGAA 96 0 
, AGTTCCTGTCNACATCGTCAC TATCTCCAGC TCAGAGTTTG ATGGCTAA 10 08 

(9) INFORMATION FOR SEQ ID NO : 8 : 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 5 amino acids 

(B) TYPJE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

10 (ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Glu Ser Ser Phe Ser Phe Gly Val lie Leu Ala Val Leu Ala Ser 
15 10 15 

Leu lie lie Ala Thr Asn Thr Leu Val Ala Val Ala Val Leu Leu Leu 
15 20 25 30 

lie His Lys Asn Asp Gly Val Ser Leu Cys Phe Thr Leu Asn Leu Ala 
35 40 45 

Val Ala Asp Thr Leu lie Gly Val Ala lie Ser Gly Leu Leu Thr Asp 
50 55 60 

20 Gin Leu Ser Ser Pro Ser Arg Pro Thr Gin Lys Thr Leu Cys Ser Leu 

65 70 75 80 

Arg Met Ala Phe Val Thr Ser Ser Ala Ala Ala Ser Val Leu Thr Val 
85 90 95 

Met Leu lie Thr Phe Asp Arg Tyr Leu Ala He Lys Gin Pro Phe Arg 
25 100 105 110 

Tyr Leu Lys He Met Ser Gly Phe Val Ala Gly Ala Cys He Ala Gly 
115 120 125 

Leu Trp Leu Val Ser Tyr Leu He Gly Phe Leu Pro Leu Gly He Pro 
130 135 140 

30 Met Phe Gin Gin Thr Ala Tyr Lys Gly Gin Cys Ser Phe Phe Ala Val 

145 150 155 160 

Phe His Pro His Phe Val Leu Thr Leu Ser Cys Val Gly Phe Phe Pro 
165 170 175 

Ala Met Leu Leu Phe Val Phe Phe Tyr Cys Asp Met Leu Lys He Ala 

35 180 185 190 
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Ser Met H^s Ser Gin Gin lie Arg Lys Met Glu His Ala Gly Ala Met 
1S»5 200 205 

Ala Gly Gly .Tyr Arg Ser Pro Arg Thr Pro Ser Asp Phe Lys Ala Leu 
210 \ 215 220 

5 Arg Thr Val Se*r Val Leu lie Gly Ser Phe Ala Leu Ser Trp Thr Pro 

225 230 235 240 

Phe Leu lie Thr <?ly lie Val Gin Val Ala Cys Gin Glu Cys His Leu 
2^5 2 50 255 

Tyr Leu Val Leu Glu Arg Tyr Leu Trp Leu Leu Gly Val Gly Asn Ser 
10 260 265 270 

Leu Leu Asn Pro Leu Tie Tyr Ala Tyr Trp Gin Lys Glu Val Arg Leu 
275 280 285 

Gin Leu Tyr His Met Ala Leu Gly Val Lys Lys Val Leu Thr Ser Phe 
290 295 300 

15 Leu Leu Phe Leu Ser Ala Arg Asn Cys Gly Pro Glu Arg Pro Arg Glu 

305 310 315 320 

Ser Ser Cys His lie Val Thr lie Ser Ser Ser Glu Phe Asp Gly 

325 330 335 

(10) INFORMATION FOR SEQ ID NO : 9 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1413 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGGACACTA C C ATGGAAGC TGACCTGGGT GCCACTGGCC ACAGGCCCCG CACAGAGCTT 60 

GATGATGAGG ACTCCTACCC CCAAGGTGGC TGGGACACGG TCTTCCTGGT GGCCCTGCTG 12 0 

CTCCTTGGGC TGCCAGCCAA TGGGTTGATG GCGTGGCTGG CCGGCTCCCA GGCCCGGCAT 18 0 

30 GGAGCTGGCA CGCGTCTGGC GCTGCTCCTG CTCAGCCTGG CCCTCTCTGA CTTCTTGTTC 24 0 

CTGGCAGCAG CGGCCTTCCA GATCCTAGAG ATCCGGCATG GGGGACACTG GCCGCTGGGG 3 00 

ACAGCTGCCT GCCGCTTCTA CTACTTCCTA TGGGGCGTGT CCTACTCCTC CGGCCTCTTC 3 60 

CTGCTGGCCG CCCTCAGCCT CGACCGCTGC CTGCTGGCGC TGTGCCCACA CTGGTACCCT 420 

GGGCACCGCC CAGTCCGCCT GCCCCTCTGG GTCTGCGCCG GTGTCTGGGT GCTGGCCACA 4 80 
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CTCTTCAGCG TGCCCTGGCT GGTCTTCCCC GAGGCTGCCG TCTGGTGGTA CGACCTGGTC 54 0 
AT C TGC CTGG ACTTCTGGGA CAGCGAGGAG CTGTCGCTGA GGATGCTGGA GGTCCTGGGG 6 00 
GGCTTCCTGC CTTTCCTCCT GCTGCTCGTC TGCCACGTGC TCACCCAGGC CACAGCCTGT 6 60 
CGCACCTGCC ACCGCCAACA GCAGCCCGCA GCCTGCCGGG GCTTCGCCCG TGTGGCCAGG 72 0 
5 ACCATTCTGT CAGCCTATGT GGTCCTGAGG CTGCCCTACC AGCTGGC C C A GCTGCTCTAC 7 80 
CTGGCCTTCC TGTGGGACGT CTACTCTGGC TACCTGCTCT GGGAGGCCCT GGTCT ACT C C 84 0 
GACTACCTGA TCCTACTCAA CAGCTGCCTC AGCCCCTTCC TCTGC CTCAT GGCCAGTGCC 90 0 
GACCTCCGGA CCCTGCTGCG CTCCGTGCTC TCGTCCTTCG CGGCAGCTCT CTGCGAGGAG 960 
CGGCCGGGCA GCTTCACGCC CACTGAGCCA CAGACCCAGC TAGATTCTGA GGGTCCAACT1 0 2 0 

10CTGCCAGAGC CGATGGCAGA GGCCCAGTCA CAGATGGATC CTGTGGCCCA GCCTCAGGTG1 08 0 
AACCCCACAC TCCAGCCACG ATCGGATCCC ACAGCTCAGC CACAGCTGAA CCCTACGGCC114 0 
CAGC CACAGT CGGATCCCAC AGCCCAGCCA CAGCTGAACC TCATGGC C C A GCCACAGTCA1 2 0 0 
GATTCTGTGG CCCAGCCACA GGCAGACACT AACGTCCAGA CCCCTGCACC TGCTGCCAGT126 0 
TCTGTGCCCA GTCCCTGTGA TGAAGCTTCC CCAACCCCAT CCTCGCATCC TACCCCAGGG1 3 2 0 

1 5 GCCCTTGAGG ACCCAGCCAC ACCTCCTGCC TCTGAAGGAG AAAGCC C C AG CAGCACCCCG1 3 8 0 
CCAGAGGCGG CCCCGGGCGC AGGCCCCACG TGA 1413 
(11) INFORMATION FOR SEQ ID NO: 10: 



20 



<i) 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 468 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: not relevant 



(ii) 



MOLECULE TYPE: protein 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 10 



25 



Met 



1 



Asp Thr Thr Met Glu Ala Asp Leu Gly Ala Thr Gly His Arg Pro 

5 10 15 



Arg 



Thr Glu Leu Asp Asp Glu Asp Ser Tyr Pro Gin Gly Gly Trp Asp 
20 25 30 



30 



Thr 



Val Phe Leu Val Ala Leu Leu Leu Leu Gly Leu Pro Ala Asn Gly 
35 40 45 



Leu 



Met Ala Trp Leu Ala Gly Ser Gin Ala Arg His Gly Ala Gly Thr 
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f • \ 5 0 5 5 6 0 

Arg Lev Ala Leu Leu Leu Leu Ser Leu Ala Leu Ser Asp Phe Leu Phe 
65 70 75 80 

Leu Ala Ala Ala Ala Phe Gin He Leu Glu He Arg His Gly Gly His 
5 85 90 95 

Trp Pro Leu Gly Thr Ala Ala Cys Arg Phe Tyr Tyr Phe Leu Trp Gly 
106 105 110 

Val Ser Tyr Ser S^er Gly Leu Phe Leu Leu Ala Ala Leu Ser Leu Asp 
115 120 125 

10 Arg Cys Leu Leu Ala Leu Cys Pro His Trp Tyr Pro Gly His Arg Pro 

130 135 140 

Val Arg Leu Pro Leu Trp Val Cys Ala Gly Val Trp Val Leu Ala Thr 
145 150 155 160 

Leu Phe Ser Val Pro Trp Leu Val Phe Pro Glu Ala Ala Val Trp Trp 
15 165 170 175 

Tyr Asp Leu Val He Cys Leu Asp Phe Trp Asp Ser Glu Glu Leu Ser 
180 185 190 

Leu Arg Met Leu Glu Val Leu Gly Gly Phe Leu Pro Phe Leu Leu Leu 
195 200 205 

20 Leu Val Cys His Val Leu Thr Gin Ala Thr Arg Thr Cys His Arg Gin 

210 215 220 



Gin Gin Pro Ala 
225 

Leu Ser Ala Tyr 

25 

Leu Tyr Leu Ala 
260 

Glu Ala Leu Val 
275 

30 Ser Pro Phe Leu 

290 

Arg Ser Val Leu 
305 

Gly Ser Phe Thr 

35 



Ala Cys Arg Gly Phe Ala 
230 

Val Val Leu Arg Leu Pro 
245 250 

Phe Leu Trp Asp Val Tyr 
265 

Tyr Ser Asp Tyr Leu He 
280 

Cys Leu Met Ala Ser Ala 
295 

Ser Ser Phe Ala Ala Ala 
310 

Pro Thr Glu Pro Gin Thr 
325 330 



Arg Val Ala Arg Thr He 
235 240 

Tyr Gin Leu Ala Gin Leu 

255 

Ser Gly Tyr Leu Leu Trp 
270 

Leu Leu Asn Ser Cys Leu 
285 

Asp Leu Arg Thr Leu Leu 
300 

Leu Cys Glu Glu Arg Pro 
315 320 

Gin Leu Asp Ser Glu Gly 
335 



Pro Thr Leu Pro Glu Pro Met Ala Glu Ala Gin Ser Gin Met Asp Pro 
340 345 350 
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Val Ala Gin Pro Gin Val Asn Pro Thr Leu Gin Pro Arg Ser Asp Pro 
355 360 365 

Thr Ala Gin Pro Gin Leu Asn Pro Thr Ala Gin Pro Gin Ser Asp Pro 
370 375 380 

5 Thr Ala Gin Pro Gin Leu Asn Leu Met Ala Gin Pro Gin Ser Asp Ser 

385 390 395 400 

Val Ala Gin fro Gin Ala Asp Thr Asn Val Gin Thr Pro Ala Pro Ala 
\ 405 410 415 

\ 

Ala Ser Ser Val\ Pro Ser Pro Cys Asp Glu Ala Ser Pro Thr Pro Ser 
10 420 425 430 

Ser His Pro Thr Pro Gly Ala Leu Glu Asp Pro Ala Thr Pro Pro Ala 
435 440 445 

Ser Glu Gly Glu Ser Pro Ser Ser Thr Pro Pro Glu Ala Ala Pro Gly 
450 455 460 

15 Ala Gly Pro Thr 

465 

(12) INFORMATION FOR SEQ ID NO; 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 8 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

25 ATGTCAGGGA TGGAAAAACT TCAGAATGCT TCCTGGATCT ACCAGCAGAA ACTAGAAGAT 60 
C CATTCCAG A AACACCTGAA CAGCACCGAG GAGTATCTGG CCTTCCTCTG CGGACCTCGG 12 0 
CGCAGCCACT TCTTCCTCCC CGTGTCTGTG GTGTATGTGC CAATTTTTGT GGTGGGGGTC 18 0 
ATTGGCAATG TCCTGGTGTG C CTGGTG ATT CTGCAGCACC AGGCTATGAA GACGCCCACC 24 0 
AACTACTACC TCTTCAGCCT GGCGGTCTCT GACCTCCTGG TCCTGCTCCT TGGAATGCCC 300 

30 CTGGAGGTCT ATGAGATGTG GCGCAACTAC CCTTTCTTGT TCGGGCCCGT GGGCTGCTAC 3 60 
TTCAAGACGG CCCTCTTTGA GACCGTGTGC TTCGCCTCCA TCCTCAGCAT CACCACCGTC 42 0 
AGCGTGGAGC GCTACGTGGC CATCCTACAC CCGTTCCGCG CCAAACTGCA GAGCACCCGG 4 80 
CGCCGGGCCC TCAGGATCCT CGGCATCGTC TGGGGCTTCT CCGTGCTCTT CTCCCTGCCC 54 0 
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AACACCAGGA TCCATGGCAT CAAGTTCCAC TACTTCCCCA ATGGGTCCCT GGTCCCAGGT 600 

TCGGCCACCT GTACGGTCAT CAAGCCCATG TGG AT CT AC A ATTTCATCAT CCAGGTCACC 66 0 

TCCTTCCTAT T^TACCTCCT CCCCATGACT GTCATCAGTG TCCTCTACTA CCTCATGGCA 72 0 

CTCAGACTAA AGAAAGACAA ATCTCTTGAG GCAGATGAAG GGAATG C AAA TATTCAAAGA 7 80 

5 CCCTGCAGAA AATCAGTCAA CAAGATGCTG TTTGTCTTGG TCTTAGTGTT TGCTATCTGT 84 0 

TGGGCCCCGT TCCACAT>TGA CCGACTCTTC TTCAGCTTTG TGGAGGAGTG GAGTGAATCC 90 0 

CTGGCTGCTG TGTTCAACCT CGTCCATGTG GTGTCAGGTG TCTTCTTCTA CCTGAGCTCA 96 0 

GCTGTCAACC CCATTATCTA TAACCTACTG TCTCGCCGCT TCCAGGCAGC ATTCCAGAAT 102 0 

GTGATCTCTT CTTTCCACAA ACAGTGGCAC TCCCAGCATG ACCCACAGTT GCCACCTGCC1 0 8 0 

1 0 CAGCGGAACA TCTTCCTGAC AGAATGCCAC TTTGTGGAGC TGACCGAAGA TATAGGTCCC114 0 

CAATTCCCAT GTCAGTCATC CATGCACAAC TCTCACCTCC CAACAGCCCT CTCTAGTGAA1 200 

CAGATGTCAA GAACAAACTA TCAAAGCTTC CACTTTAACA AAACCTGA 124 8 

(13) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS; 
15 (A) LENGTH: 415 amino acids 

(B) TYPE : amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

25 Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 

35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr 
30 65 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 
85 90 95 
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Leu Gly Met Pro 

100 

Leu Phe Gly Pro 
115 

5 Val Cys Phe Ala 

130 

Tyr Val Ala lie 
145 

Arg Arg Ala Leu 

10 

Phe Ser Leu Pro 
180 

Pro Asn Gly Ser 

195 

15 Pro Met Trp lie 

210 

Tyr Leu Leu Pro 
225 

Leu Arg Leu Lys 

20 

Asn lie Gin Arg 
260 

Leu Val Leu Val 
275 

25 Leu Phe Phe Ser 

290 

Phe Asn Leu Val 
305 

Ala Val Asn Pro 

30 

Ala Phe Gin Asn 
340 

His Asp Pro Gin 
355 

35 Cys His Phe Val 

370 

Gin Ser Ser Met 
385 
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Leu Glu Val Tyr Glu Met 
105 

Val Gly Cys Tyr Phe Lys 
120 

Ser lie Leu Ser lie Thr 

135 

Leu His Pro Phe Arg Ala 
150 

Arg lie Leu Gly He Val 
165 170 

Asn Thr Ser He His Gly 
185 

Leu Val Pro Gly Ser Ala 
200 

Tyr Asn Phe He He Gin 

215 

Met Thr Val He Ser Val 

230 

Lys Asp Lys Ser Leu Glu 
245 250 

Pro Cys Arg Lys Ser Val 
265 

Phe Ala He Cys Trp Ala 
280 

Phe Val Glu Glu Trp Ser 
295 

His Val Val Ser Gly Val 
310 

He He Tyr Asn Leu Leu 

325 330 

Val He Ser Ser Phe His 

345 

Leu Pro Pro Ala Gin Arg 
360 

Glu Leu Thr Glu Asp He 
375 

His Asn Ser His Leu Pro 
390 



PATENT 

Trp Arg Asn Tyr Pro Phe 

110 

Thr Ala Leu Phe Glu Thr 
125 

Thr Val Ser Val Glu Arg 
140 

Lys Leu Gin Ser Thr Arg 
155 160 

Trp Gly Phe Ser Val Leu 
175 

He Lys Phe His Tyr Phe 
190 

Thr Cys Thr Val He Lys 
205 

Val Thr Ser Phe Leu Phe 

220 

Leu Tyr Tyr Leu Met Ala 

235 240 

Ala Asp Glu Gly Asn Ala 
255 

Asn Lys Met Leu Phe Val 
270 

Pro Phe His He Asp Arg 
285 

Glu Ser Leu Ala Ala Val 
300 

Phe Phe Tyr Leu Ser Ser 
315 320 

Ser Arg Arg Phe Gin Ala 
335 

Lys Gin Trp His Ser Gin 
350 

Asn He Phe Leu Thr Glu 
365 

Gly Pro Gin Phe Pro Cys 
380 

Thr Ala Leu Ser Ser Glu 
395 400 
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\ 



Gin Met S^r Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
4 05 410 415 



( 14 ) INFORMATION \FOR SEQ ID NO : 13 : 



5 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1173 base pairs 

(B) TYPE:\ nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 



i 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ATGCCAGATA CTAATAGCAC AATCAATTTA TCACTAAGCA CTCGTGTTAC TTTAGCATTT 60 
TTTATGTCCT TAGTAGCTTT TGCTATAATG CTAGGAAATG CTTTGGT CAT TTTAGCTTTT 120 
i GTGGTGGACA AAAACCTTAG ACATCGAAGT AGTTATTTTT TTCTTAACTT GGCCATCTCT 180 
1 GACTTCTTTG TGGGTGTGAT CTCCATTCCT TTGTACATCC CTCACACGCT GTTCGAATGG 24 0 
15 GATTTTGGAA AGGAAAT CTG TGTATTTTGG CTCACTACTG ACT AT CTGTT ATGTACAGCA 3 00 
TCTGTATATA ACATTGTCCT CATCAGCTAT GATCGATACC TGTCAGTCTC AAATGCTGTG 3 60 
TCTTATAGAA CTCAACATAC TGGGGTCTTG AAGATTGTTA CTCTGATGGT GGCCGTTTGG 42 0 
GTGCTGGCCT TCTTAGTGAA TGGGCCAATG ATTCTAGTTT CAGAGTCTTG GAAGGATGAA 4 80 
GGTAGTGAAT GTGAACCTGG ATTTTTTTCG GAATGGTACA TCCTTGCCAT CACATCATTC 540 
20 TTGGAATTCG TGATCCCAGT CATCTTAGTC GCTTATTTCA ACATGAATAT TTATTGGAGC 600 
CTGTGGAAGC GTGATCATCT CAGTAGGTGC CAAAGCCATC CTGGACTGAC TGCTGTCTCT 660 
TCCAACATCT GTGGACACTC ATTCAGAGGT AGACTATCTT CAAGGAGATC TCTTTCTGCA 720 
TCGACAGAAG TTCCTGCATC CTTTCATTCA GAGAGACAGA GGAGAAAGAG TAGTCTCATG 7 80 
TTTTCCTCAA G AAC CAAGAT GAATAGCAAT ACAATTGCTT CCAAAATGGG TTCCTTCTCC 84 0 
25 CAATCAGATT CTGTAGCTCT T C AC CAAAGG GAACATGTTG AACTGCTTAG AGCCAGGAGA 900 
TTAGCCAAGT CACTGGCCAT TCTCTTAGGG GTTTTTGCTG TTTGCTGGGC T C CAT ATT CT 960 
CTGTTCACAA TTGTC CTTTC ATTTTATTCC TCAGCAACAG GTCCTAAATC AGTTTGGTATl 02 0 
AGAATTGCAT TTTGGCTTCA GTGGTTCAAT TCCTTTGTCA ATCCTCTTTT GTATCCATTG1 08 0 
TGTCACAAGC GCTTTCAAAA GGCTTTCTTG AAAATATTTT GTATAAAAAA GCAACCTCTA1 14 0 
30 CCATCACAAC ACAGTCGGTC AGTATCTTCT TAA 117 3 
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\rf (15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Pro Asp Thr Asn Ser Thr lie Asn Leu Ser Leu Ser Thr Arg Val 
10 1 5 10 15 

Thr Leu Ala Phe Phe Met Ser Leu Val Ala Phe Ala lie Met Leu Gly 
20 25 30 

Asn Ala Leu Val lie Leu Ala Phe Val Val Asp Lys Asn Leu Arg His 
35 40 45 

15 Arg Ser Ser Tyr Phe Phe Leu Asn Leu Ala lie Ser Asp Phe Phe Val 

50 55 60 

Gly Val lie Ser lie Pro Leu Tyr lie Pro His Thr Leu Phe Glu Trp 
65 70 75 80 

Asp Phe Gly Lys Glu lie Cys Val Phe Trp Leu Thr Thr Asp Tyr Leu 
20 85 90 95 

Leu Cys Thr Ala Ser Val Tyr Asn lie Val Leu lie Ser Tyr Asp Arg 
100 105 110 

Tyr Leu Ser Val Ser Asn Ala Val Ser Tyr Arg Thr Gin His Thr Gly 
115 120 125 

25 Val Leu Lys lie Val Thr Leu Met Val Ala Val Trp Val Leu Ala Phe 

130 135 140 

Leu Val Asn Gly Pro Met lie Leu Val Ser Glu Ser Trp Lys Asp Glu 
145 150 155 160 

Gly Ser Glu Cys Glu Pro Gly Phe Phe Ser Glu Trp Tyr lie Leu Ala 
30 165 170 175 

He Thr Ser Phe Leu Glu Phe Val He Pro Val He Leu Val Ala Tyr 
180 185 190 

Phe Asn Met Asn He Tyr Trp Ser Leu Trp Lys Arg Asp His Leu Ser 
195 200 205 

35 Arg Cys Gin Ser His Pro Gly Leu Thr Ala Val Ser Ser Asn He Cys 

210 215 220 
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Gly His Ser Phe Arg Gly Arg Leu Ser Ser Arg Arg Ser Leu Ser Ala 
225 230 235 240 

Ser Thr Glu Val Pro Ala Ser Phe His Ser Glu Arg Gin Arg Arg Lys 
245 250 255 

5 Ser Ser Leu Met ^»he Ser Ser Arg Thr Lys Met Asn Ser Asn Thr lie 

260 265 270 

Ala Ser Lys Met Gly Ser Phe Ser Gin Ser Asp Ser Val Ala Leu His 
275 280 285 

Gin Arg Glu His Val Glu Leu Leu Arg Ala Arg Arg Leu Ala Lys Ser 
10 290 295 300 

Leu Ala lie Leu Leu QXy Val Phe Ala Val Cys Trp Ala Pro Tyr Ser 
305 3X0 315 320 

Leu Phe Thr lie Val Leu Ser Phe Tyr Ser Ser Ala Thr Gly Pro Lys 

325 330 335 

15 Ser Val Trp Tyr Arg lie Ala Phe Trp Leu Gin Trp Phe Asn Ser Phe 

340 345 350 

Val Asn Pro Leu Leu Tyr Pro Leu Cys His Lys Arg Phe Gin Lys Ala 
355 360 365 

Phe Leu Lys lie Phe Cys lie Lys Lys Gin Pro Leu Pro Ser Gin His 
20 370 375 380 

Ser Arg Ser Val Ser Ser 
385 390 

(16) INFORMATION FOR SEQ ID NO:15i 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

ATGGCGAACG CGAGCGAGCC GGGTGGCAGC GGCGGCGGCG AGGCGGCCGC CCTGGGCCTC 60 
AAGCTGGCCA CGCTCAGCCT GCTGCTGTGC GTGAGC CT AG CGGGCAACGT GCTGTTCGCG 12 0 
CTGCTGATCG TGCGGGAGCG CAGCCTGCAC CGCGCCCCGT ACTAC CTGCT GCTCGACCTG 180 
TGCCTGGCCG ACGGGCTGCG CGCGCTCGCC TGCCTCCCGG CCGTCATGCT GGCGGCGCGG 24 0 

35 CGTGCGGCGG CCGCGGCGGG GGCGCCGCCG GGCGCGCTGG GCTGCAAG CT GCTCGCCTTC 3 00 
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CTGGCCGCGC TCTTCTGCTT CCACGCCGCC TTCCTGCTGC TGGGCGTGGG CGTCACCCGC 360 
TACCTGGCCA TCGCGCACCA CCGCTTCTAT GCAGAGCGCC TGGCCGGCTG GCCGTGCGCC 420 
GCCATGCTGG TGtGCGCCGC CTGGGCGCTG GCGCTGGCCG CGGCCTTCCC GCCAGTGCTG 4 80 
GACGGCGGTG GCGACGACGA GGACGCGCCG TGCGCCCTGG AGCAGCGGCC CGACGGCGCC 54 0 
5 CCCGGCGCGC TGGGCtTCCT GCTGCTGCTG GCCGTGGTGG TGGGCGCCAC GCACCTCGTC 600 
TACCTCCGCC TGCTCT^CTT CATCCACGAC CGC CGCAAGA TGCGGCCCGC GCGCCTGGTG 660 
CCCGCCGTCA GCCACGACTG GACCTTCCAC GGCCCGGGCG CCACCGGCCA GGCGGCCGCC 72 0 
AACTGGACGG CGGGCTTCGG CCGCGGGCCC ACGCCGCCCG CGCTTGTGGG CATCCGGCCC 78 0 
GCAGGGCCGG GCCGCGGCGC GCGCCGCCTC CTCGTGCTGG AAGAATTCAA GACGGAGAAG 84 0 

1 0 AGGCTGTGCA AGATGTTCTA OGCCGTCACG CTGCTCTTCC TGCTCCTCTG GGGGCCCTAC 900 
GTCGTGGCCA GCTACCTGCG GGTCCTGGTG CGGCCCGGCG CCGTCCCCCA GGC CT ACCTG 960 
ACGGCCTCCG TGTGGCTGAC CTTCGCGCAG GCCGGCATCA ACCCCGTCGT GTGCTTCCTC1020 
TTCAACAGGG AGCTGAGGGA CTGCTTCAGG GCCCAGTTCC CCTGCTGCCA GAGCCCCCGG1 0 8 0 
ACCACCCAGG CGACCCATCC CTGCGAC CTG AAAGGCATTG GTTTATGA 112 8 

15 (17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 75 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

20 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Asn Ala Ser Glu Pro Gly Gly Ser Gly Gly Gly Glu Ala Ala 
! 5 10 15 

25 Ala Leu Gly Leu Lys Leu Ala Thr teu Ser Leu Leu Leu Cys Val Ser 

20 25 30 

Leu Ala Gly Asn Val Leu Phe Ala Leu Leu lie Val Arg Glu Arg Ser 
3S 40 45 

Leu His Arg Ala Pro Tyr Tyr Leu Leu Leu Asp Leu Cys Leu Ala Asp 
30 50 55 60 

Gly Leu Arg Ala Leu Ala Cys Leu Pro Ala Val Met Leu Ala Ala Arg 
65 70 75 80 
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Arg Ala Ala Ala Ala Ala Gly Ala Pro Pro Gly Ala Leu Gly Cys Lys 
85 90 95 

Leu Leu Ala Phe Leu Ala Ala Leu Phe Cys Phe His Ala Ala Phe Leu 
100 105 110 

5 Leu Leu Gly Val Gly Val Thr Arg Tyr Leu Ala He Ala His His Arg 

115 120 125 

Phe Tyr Ala Glu Arg Leu Ala Gly Trp Pro Cys Ala Ala Met Leu Val 
130 135 140 

Cys Ala Ala Trp Ala Lev Ala Leu Ala Ala Ala Phe Pro Pro Val Leu 
10 145 150 155 160 

Asp Gly Gly Gly Asp Asp Qlu Asp Ala Pro Cys Ala Leu Glu Gin Arg 
165 170 175 

Pro Asp Gly Ala Pro Gly Al* Leu Gly Phe Leu Leu Leu Leu Ala Val 
180 185 190 

15 Val Val Gly Ala Thr His Leu Val Tyr Leu Arg Leu Leu Phe Phe He 

195 200 205 

His Asp Arg Arg Lys Met Arg Pro Ala Arg Leu Val Pro Ala Val Ser 
210 215 220 

His Asp Trp Thr Phe His Gly Pro Gly Ala Thr Gly Gin Ala Ala Ala 
20 225 230 235 240 

Asn Trp Thr Ala Gly Phe Gly Arg Gly Pro Thr Pro Pro Ala Leu Val 
245 250 255 

Gly He Arg Pro Ala Gly Pro Gly Arg Gly Ala Arg Arg Leu Leu Val 
260 265 270 

25 Leu Glu Glu Phe Lys Thr Glu Lys Arg Leu Cys Lys Met Phe Tyr Ala 

275 280 285 

Val Thr Leu Leu Phe Leu Leu Leu Trp Gly fro Tyr Val Val Ala Ser 
2 90 2 95 3 00 

Tyr Leu Arg Val Leu Val Arg Pro Gly Ala Val Pro Gin Ala Tyr Leu 
30 305 310 313 320 

Thr Ala Ser Val Trp Leu Thr Phe Ala Gin Ala Gly He Asn Pro Val 
325 330 335 

Val Cys Phe Leu Phe Asn Arg Glu Leu Arg Asp Cys Phe Arg Ala Gin 
340 345 350 

35 Phe Pro Cys Cys Gin Ser Pro Arg Thr Thr Gin Ala Thr His Pro Cys 

355 360 365 



Asp Leu Lys Gly He Gly Leu 



V 
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/ - 370 375 

(18) INFORMATION FOR SEQ ID NO: 17: 

/ (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1002 base pairs 
5/ (B) TYPE: nucleic acid 

/ (C) STRAND EDNESS : single 

I (D) TOPOLOGY: linear 

J (ii) MOLECULE TYPE: DNA (genomic) 

i 

: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

j 

lJO ATGAACACCA CAGTGATGCA AGGCTTCAAC AGATCTGAGC GGTGCCCCAG AGACACTCGG 60 

ATAGTACAGC TGGTATTCCC AGCCCTCTAC ACAGTGGTTT TCTTGACCGG CATCCTGCTG 12 0 

AATACTTTGG CTCTGTGGGT GTTTGTTCAC ATCCCCAGCT CCTCCACCTT CAT CAT CT AC 180 

CTCAAAAACA CTTTGGTGGC CGACTTGATA ATGACACTCA TGCTTCCTTT CAAAATCCTC 240 

TCTGACTCAC ACCTGGCACC CTGGCAGCTC AGAGCTTTTG TGTGTCGTTT TTCTTCGGTG 3 00 

1 5 ATATTTTATG AGACCATGTA TGTGGGCATC GTGCTGTTAG GGCTCATAGC CTTTGACAGA 3 60 

TTCCTCAAGA TCATCAGACC TTTGAGAAAT ATTTTTCTAA AAAAACCTGT TTTTGCAAAA 42 0 

ACGGTCTCAA TCTTCATCTG GTTCTTTTTG TTCTTCATCT CCCTGCCAAA TACGATCTTG 480 

AGCAACAAGG AAGCAACACC ATCGTCTGTG AAAAAGTGTG CTTCCTTAAA GGGGCCTCTG 54 0 

GGGCTGAAAT GGCATCAAAT GGTAAATAAC ATATGCCAGT TTATTTTCTG GACTGTTTTT 600 

20 ATCCTAATGC TTGTGTTTTA TGTGGTTATT GCAAAAAAAG T AT ATGATT C TTATAGAAAG 660 

TCCAAAAGTA AGGACAGAAA AAACAACAAA AAGCTGGAAG GCAAAGTATT TGTTGTCGTG 72 0 

GCTGTCTTCT TTGTGTGTTT TGCTCCATTT CATTTTGCCA GAG TT C CAT A TACTCACAGT 780 

CAAACCAACA ATAAGACTGA CTGTAGACTG CAAAATCAAC TGTTTATTGC TAAAGAAACA 840 

ACTCTCTTTT TGGCAGCAAC TAACATTTGT ATGGATCCCT TAATATACAT ATTCTTATGT 900 

25 AAAAAATTCA CAGAAAAGCT ACCATGTATG CAAGGGAGAA AGAC CACAGC ATCAAGC CAA 960 

GAAAATCATA GCAGTCAGAC AGACAACATA ACCTTAGGCT GA 1002 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 
30 (B) TYPE: amino acid 

(C) STRAND EDNESS : 
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(D) TOPOLOGY: not relevant 
(ii) MOLECUI^E TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Met Asn Thr Thr Val Met Gin Gly Phe Asn Arg Ser Glu Arg Cys Pro 
5 1 5 10 15 

Ax-g Asp Thr Arg lie Val Gin Leu Val Phe Pro Ala Leu Tyr Thr Val 
20 25 30 

Val Phe Leu Thr Gly He Leu Leu Asn Thr Leu Ala Leu Trp Val Phe 
35 40 45 

10 Val His He Pro Ser Ser Ser Thr Phe He He Tyr Leu Lys Asn Thr 

50 55 60 

Leu Val Ala Asp Leu He Met Thr Leu Met Leu Pro Phe Lys He Leu 

65 70 75 80 

Ser Asp Ser His Leu Ala Pro Trp Gin Leu Arg Ala Phe Val Cys Arg 
15 85 90 95 

Phe Ser Ser Val He Phe Tyr Glu Thr Met Tyr Val Gly He Val Leu 
100 105 110 

Leu Gly Leu He Ala Phe Asp Arg Phe Leu Lys He He Arg Pro Leu 
115 120 125 

20 Arg Asn He Phe Leu Lys Lys Pro Val Phe Ala Lys Thr Val Ser He 

130 135 140 

Phe He Trp Phe Phe Leu Phe Phe He Ser Leu Pro Asn Thr He Leu 
145 150 155 160 

Ser Asn Lya Glu Ala Thr Pro Ser Ser Val Lys Lys Cys Ala Ser Leu 
25 165 170 175 

Lys Gly Pro Leu Gly Leu Lys Trp His Gin Met Val Asn Asn He Cys 
180 185 190 

Gin Phe He Phe Trp Thr Val Phe He Leu Met Leu Val Phe Tyr Val 

195 200 205 

30 Val He Ala Lys Lys Val Tyr Asp Ser Tyr Arg Lys Ser Lys Ser Lys 

210 215 220 

Asp Arg Lys Asn Asn Lys Lys Leu Glu Gly Lys Val Phe Val Val Val 
225 230 235 240 

Ala Val Phe Phe Val Cys Phe Ala Pro Phe His Phe Ala Arg Val Pro 
35 245 250 255 
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f. 

Tyr Thr Has Ser Gin Thr Asn Asn Lys Thr Asp Cys Arg Leu Gin Asn 
260 265 270 

Gin Leu Phe lie Ala Lys Glu Thr Thr Leu Phe Leu Ala Ala Thr Asn 
275 280 285 

5 lie Cys Met Asp Pro Leu lie Tyr lie Phe Leu Cys Lys Lys Phe Thr 

290 295 300 

Glu Lys Leu Pro Cys Met Gin Gly Arg Lys Thr Thr Ala Ser Ser Gin 
305 310 315 320 

Glu Asn His Ser Ser Gin Thr Asp Asn lie Thr Leu Gly 
10 325 330 

(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1122 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATGGCCAACA CTACCGGAGA GCCTGAGGAG GTGAGCGGCG CTCTGTCCCC ACCGTCCGCA 60 

20 TCAGCTTATG TGAAGCTGGT ACTGCTGGGA CTGATTATGT GCGTGAGCCT GGCGGGTAAC 12 0 
GCCATCTTGT CCCTGCTGGT GCTCAAGGAG CGTGCCCTGC ACAAGGCTCC TTACTACTTC 180 
CTGCTGGACC TGTGCCTGGC CGATGGCATA CGCTCTGCCG TCTGCTTCCC CTTTGTGCTG 24 0 
GCTTCTGTGC GCCACGGCTC TTCATGGACC TTCAGTGCAC TCAGCTGCAA GATTGTGGCC 3 00 
TTTATGGCCa TGerCTTTTG CTTCCATGCG GCCTTCATGC TGTTCTGCAT CAGCGTCACC 360 

25 CGCTACATGG CCATCGCCCA CCACCGCTTC TACGCCAAGC GCATGACACT CTGGACATGC 420 
GCGGCTGTCA TCTGCATGGC CTGGACCCTG TCTGTGGC CA TGGCCTTCCC ACCTGTCTTT 4 80 
GACGTGGGCA CCTACAAGTT TATTCGGGAG GAGGACCAGT G CAT CTTTG A GCATCGCTAC 54 0 
TTCAAGGCCA ATGACACGCT GGGCTTCATG CTTATGTTGG CTGTGCTCAT GGCAGCTACC 600 
CATGCTGTCT ACGGCAAGCT GCTCCTCTTC GAGTATCGTC ACCGCAAGAT GAAGC CAGTG 66 0 

30 CAGATGGTGC CAGC CATCAG CCAGAACTGG AC ATT C CATG GTCCCGGGGC CACCGGCCAG 72 0 
GCTGCTGCCA ACTGGATCGC CGGCTTTGGC CGTGGGCCCA TGCCACCAAC CCTGCTGGGT 7 80 
ATCCGGCAGA ATGGGCATGC AGCCAGCCGG CGGCTACTGG GCATGGACGA GGTCAAGGGT 84 0 
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GAAAAGCAGC TGGGCCGCAT GTTCTACGCG ATCACACTGC TCTTTCTGCT CCTCTGGTCA 900 
CCCTACATCG TGGCCTGCTA CTGGCGAGTG TTTGTGAAAG CCTGTGCTGT GCCCCACCGC 960 
TACCTGGCCA CTGCTGTTTG GATGAGCTTC GCCCAGGCTG CCGTCAACCC AATTGTCTGC 102 0 
TTCCTGCTCA ACAAGGACCT CAAGAAGTGC CTGACCACTC ACGCCCCCTG CTGGGG C AC A 1 0 8 0 
5 GGAGGTGCCC CGGCTCCCAG AGAACCCTAC TGTGTCATGT GA 1122 
(21) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 
10 (C) STRANDEDNES3 : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ala Asn Thr Thr Gly Glu Pro Glu Glu Val Ser Gly Ala Leu Ser 
15 1 5 10 15 

Pro Pro Ser Ala Ser Ala Tyr Val Lys Leu Val Leu Leu Gly Leu lie 
2 0 25 3 0 

Met Cys Val Ser Leu Ala Gly Asn Ala lie Leu Ser Leu Leu Val Leu 
3 5 4 0 45 

20 Lys Glu Arg Ala Leu His Lys Ala Pro Tyr Tyr Phe Leu Leu Asp Leu 

50 55 60 

Cys Leu Ala Asp Gly lie Arg Ser Ala Val Cys Phe Pro Phe Val Leu 

65 70 75 80 

Aim S*r Val Arg His Gly Ser Ser Trp Thr Phe Ser Ala Leu Ser Cys 

25 85 90 95 

~ Ly* He Val Ala Phe Met Ala Val Leu Phe Cys Phe His Ala Ala Phe 

100 105 110 

Met Leu Phe Cys He Ser Val Thr Arg Tyr Met Ala He Ala His His 
115 120 125 

30 Arg Phe Tyr Ala Lys Arg Met Thr Leu Trp Thr Cys Ala Ala Val He 

130 135 140 

Cys Met Ala Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Phe 

145 150 155 160 

Asp Val Gly Thr Tyr Lys Phe He Arg Glu Glu Asp Gin Cys He Phe 
35 165 < 170 175 
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j. f " % Glu His Arg Tyr Phe Lys Ala Asn Asp Thr Leu Gly Phe Met Leu Met 

180 185 190 

Leu Ala Val Leu Met Ala Ala Thr His Ala Val Tyr Gly Lys Leu Leu 
195 200 205 

j5 Leu Phe Glu Tyr Arg His Arg Lys Met Lys Pro Val Gin Met Val Pro 

210 215 220 

; Ala lie Ser Gin Asn Trp Thr Phe His Gly Pro Gly Ala Thr Gly Gin 

225 230 235 240 

Ala Ala Ala Asn Trp lie Ala Gly Phe Gly Arg Gly Pro Met Pro Pro 
ID 245 250 255 

Thr Leu Leu Gly lie Arg Gin Asn Gly His Ala Ala Ser Arg Arg Leu 
260 265 270 

Leu Gly Met Asp Glu Val Lys Gly Glu Lys Gin Leu Gly Arg Met Phe 
275 280 285 

15 Tyr Ala lie Thr Leu Leu Phe Leu Leu Leu Trp Ser Pro Tyr lie Val " 

290 295 300 

Ala Cys Tyr Trp Arg Val Phe Val Lys Ala Cys Ala Val Pro His Arg 

305 310 315 320 

Tyr Leu Ala Thr Ala Val Trp Met Ser Phe Ala Gin Ala Ala Val Asn 
20 325 330 335 

Pro lie Val Cys Phe Leu Leu Asn Lys Asp Leu Lys Lys Cys Leu Thr 
340 345 350 

Thr His Ala Pro Cys Trp Gly Thr Gly Gly Ala Pro Ala Pro Arg Glu 
355 360 365 

25 Pro Tyr Cys Val Met 

370 

(22) INFORMATION FOR SEQ ID NO: 21: 

U) SEQUENCB CHARACTERISTICS : 

(A) LENGTH: 1053 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 21: 
35 ATGGCTTTGG AACAGAACCA GTCAACAGAT TATTATTATG AGGAAAATGA AATGAATGGC 60 
ACTTATGACT ACAGTCAATA TGAATTGATC TGTATCAAAG AAGATGTCAG AGAATTTGCA 120 
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AAAGTTTTCC T CC CTGTATT CCTCACAATA GCTTTCGTCA TTGGACTTGC AGGCAATTCC 180 
ATGGTAGTGG CAATTTATGC CTATTACAAG AAACAGAGAA CCAAAACAGA TGTGTACATC 24 0 
CTGAATTTGG CTGTAGCAGA TTTACTCCTT CTATTCACTC TGC CTTTTTG GGCTGTTAAT 3 00 

( 

GCAGTTCATG GGTGGGTTTT AGGGAAAATA ATGTGCAAAA TAACTTCAGC CTTGTACACA 3 60 
5 CTAAACTTTG TCTCTGGAAT GCAGTTTCTG GCTTGCATCA GCATAGACAG ATATGTGGCA 42 0 
j GTAACTAATG TCCCCAGCCA ATCAGGAGTG GGAAAACCAT GCTGGATCAT CTGTTTCTGT 4 80 
| GTCTGGATGG CTGCCATCTT GCTGAGCATA CCCCAGCTGG TTTTTTATAC AGTAAATGAC 54 0 
; AATGCTAGGT GCATTC C CAT TTTCCCCCGC TACCTAGGAA CATCAATGAA AGCATTGATT 600 
CAAATGCTAG AGATCTGCAT TGGATTTGTA GTACCCTTTC TTATTATGGG GGTGTGCTAC 660 
10 TTTATCACGG CAAGGACACT CATGAAGATG CCAAACATTA AAATATCTCG ACCCCTAAAA 72 0 
GTTCTGCTCA CAGTCGTTAT AGTTTTCATT GTCACTCAAC TGCCTTATAA CATTGTCAAG 780 

i 

TTCTGCCGAG CCATAGACAT CATCTACTCC CTGATCACCA GCTGCAACAT GAGCAAACGC 84 0 

ATGGACATCG CCATCCAAGT CACAGAAAGC ATTGCACTCT TTCACAGCTG CCTCAACCCA 900 

ATCCTTTATG TTTTTATGGG AGCATCTTTC AAAAACTACG TTATGAAAGT GGCCAAGAAA 960 

1 5 TATGGGTCCT GGAGAAGACA GAGACAAAGT GTGGAGGAGT TTCCTTTTGA TTCTGAGGGT 102 0 

CCTACAGAGC CAAC CAGTAC TTTTAGCATT TAA 1053 

(23) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 
20 (B) TYPE : amino acid 

(C) STRANDEDNESS : 
(DV- TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

25 Met Ala Leu Glu Gin Asn Gin Ser Thr Asp Tyr Tyr Tyr Glu Glu Asn 

15 10 15 

Glu Met Asn Gly Thr Tyr Asp Tyr Ser Gin Tyr Glu Leu lie Cys lie 
20 25 30 

Lys Glu Asp Val Arg Glu Phe Ala Lys Val Phe Leu Pro Val Phe Leu 
30 35 40 45 

Thr lie Ala Phe Val He Gly Leu Ala Gly Asn Ser Met Val Val Ala 



, ..hV 
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56 



55 



60 



He 
65 

Leu 



Trp 
Lys 



Tyr Ala Tyr Tyr 



Lys Lys 
70 



Asn LeU Ala Val Ala Asp 
85 

Ala Val A*m Ala Val His 
100 

He Thr Ser Ala Leu Tyr 

115 



Gin Arg Thr 



Leu Leu Leu 
90 

Gly Trp Val 
105 

Thr Leu Asn 
120 



Lys Thr Asp Val 
75 

Leu Phe Thr Leu 



Tyr He 
80 

Pro Phe 
95 



Leu Gly Lys He Met Cys 
110 

Phe Val Ser Gly Met Gin 
125 



10 



Phe 



Pro 
145 



Leu Ala Cys lie 

130 

Ser Gin Ser Gly 



Ser He 

135 

Val Gly 
150 



Asp Arg Tyr 
Lys Pro Cys 



Val Ala Val Thr 
140 

Trp He He Cys 
155 



Asn Val 



Phe Cys 

160 



15 



Val Trp Met Ala Ala 
165 



He Leu 



Leu Ser He 

170 



Pro Gin Leu Val 



Phe Tyr 
175 



Thr 



Gly 



Val Asn Asp Asn Ala Arg 
180 



Thr Ser Met Lys 
195 



Ala Leu 



Cys He Pro He Phe Pro Arg Tyr Leu 
185 190 

He Gin Met Leu Glu He Cys He Gly 
200 205 



20 



25 



30 



Phe 



Arg 
225 

Val 



Asn 



Val Val Pro Phe 

210 

Thr Leu Met Lys 



Leu He 

215 

Met Pro 

230 



Leu Leu Thr Val Val He 
245 



Thr 



Glu 



Phe 

305 

Tyr 



He Val Lys Phe 

260 

Ser Cys Asn Met 

275 

Ser He Ala Leu 

290 

Met Gly Ala Ser 



Cys Arg 
Ser Lys 



Phe His 
295 

Phe Lys 

310 



35 



Gly Ser Trp Arg Arg Gin 
325 



Met Gly Val 
Asn He Lys 

Val Phe He 

250 

Ala He Asp 

265 

Arg Met Asp 
280 

Ser Cys Leu 
Asn Tyr Val 



Arg Gin Ser 
330 



Cys Tyr Phe He Thr Ala 
220 



He Ser Arg Pro 

235 

Val Thr Gin Leu 



He He Tyr Ser 

270 



Leu Lys 
240 

Pro Tyr 
255 

Leu He 



He Ala He Gin Val Thr 
285 

Asn Pro He Leu Tyr Val 
300 



Met Lys Val Ala 
315 

val Glu Glu Phe 



Lys Lys 
320 

Pro Phe 
335 



Asp Ser Glu Gly Pro Thr Glu Pro Thr Ser Thr Phe Ser He 
340 345 350 
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(24) INFORMATION FOR SEQ ID NO: 23: 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 base pairs 

(B) TYPE: nucleic acid 

(C) StRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



ATGCCAGGAA ACGCCACCCC AGTGACCACC ACTGCCCCGT GGGCCTCCCT GGGCCTCTCC 60 

10 GCCAAGACCT GCAACAACGT GTCCTTCGAA GAGAGCAGGA TAGTC CTGGT CGTGGTGTAC 120 
AGCGCGGTGT GCACGCTGGO GGTGC CGGCC AACTGCCTGA CTGCGTGGCT GGCGCTGCTG 180 
CAGGTACTGC AGGGCAACGT GCTGGCCGTC T AC CTGCTCT GCCTGGCACT CTGCGAACTG 24 0 
CTGTACACAG GCACGCTGCC ACTCTGGGTC ATCTATATCC GCAACCAGCA CCGCTGGACC 3 00 
CTAGGCCTGC TGGCCTCGAA GGTGACCGCC TACATCTTCT TCTGCAACAT CTACGTCAGC 3 60 

1 5 ATCCTCTTCC TGTGCTGCAT CTCCTGCGAC CGCTTCGTGG CCGTGGTGTA CGCGCTGGAG 42 0 
AGTCGGGGCC GCCGCCGCCG GAGGACCGCC ATCCTCATCT CCGCCTGCAT CTTCATCCTC 4 80 
GTCGGGATCG TTCACTACCC GGTGTTCCAG ACGGAAGACA AGGAGACCTG CTTTGACATG 54 0 
CTGCAGATGG ACAGCAGGAT TGCCGGGTAC TACTACGCCA GGTTCAC CGT TGGCTTTGCC 600 
ATCCCTCTCT CCATCATCGC CTTCACCAAC CACCGGATTT TCAGGAGCAT CAAGCAGAGC 660 

20 ATGGGCTTAA GCGCTGCCCA GAAGGCCAAG GTGAAGCACT CGGCCATCGC GGTGGTTGTC 72 0 
AT CTTCCT AG TCTGCTTCGC CCCGTACCAC CTGGTTCTCC TCGTCAAAGC CGCTGCCTTT 7 80 
TCCTACTACA GAGGAGACAG GAACGCCATG TGCGGCTTGG AGGAAAGGCT GTACACAGCC 84 0 
TCTQTGGTGT TTCTGTGCCT GTCCACGGTG AACGGCGTGG CTGACCCCAT TATCTACGTG 900 
CTGGCCACGG ACCATTCCCG CCAAGAAGTG TCCAGAATCC ATAAGGGGTG GAAAGAGTGG 960 

25 TCCATGAAGA CAGACGTCAC CAGGCTCACC CACAGCAGGG ACACCGAGGA GCTGCAGTCG1 0 2 0 
CCCGTGGCCC TTGCAGACCA CTACACCTTC TCCAGGCCCG TGCACCCACC AGGGTCACCA1 0 8 0 
TGCCCTGCAA AGAGGCTGAT TGAGGAGTCC TGCTGA 1116 
(25) INFORMATION FOR SEQ ID NO: 24: 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Pro Gly Asn Ala Thr Pro Val Thr Thr Thr Ala Pro Trp Ala Ser 
15 10 15 

Leu Gly Leu Ser Ala Lys Thr Cys Asn Asn Val Ser Phe Glu Glu Ser 
20 25 30 

10 Arg lie Val Leu Val Val Val Tyr Ser Ala Val Cys Thr Leu Gly Val 

35 40 45 



Pro Ala Asn Cys Leu Thr Ala Trp Leu Ala Leu Leu Gin Val Leu Gin 
50 55 60 

Gly Asn Val Leu Ala Val Tyr Leu Leu Cys Leu Ala Leu Cys G lu Leu 
15 65 70 75 80 

Leu Tyr Thr Gly Thr Leu Pro Leu Trp Val lie Tyr lie Arg Asn Gin 
85 90 95 

His Arg Trp Thr Leu Gly Leu Leu Ala Ser Lys Val Thr Ala Tyr lie 
100 105 110 

20 Phe Phe Cys Asn lie Tyr Val Ser lie Leu Phe Leu Cys Cys He Ser 

115 120 125 

Cys Asp Arg Phe Val Ala Val Val Tyr Ala Leu Glu Ser Arg Gly Arg 
130 135 140 

Arg Arg Arg Arg Thr Ala He Leu He Ser Ala Cys He Phe He Leu 
25 145 ISO 155 160 

Val Gly He Val His Tyr Pro Val Phe Gin Thr Glu Asp Lys Glu Thr 
165 170 175 

Cys Phe Asp Met Leu Gin Met Asp Ser Arg He Ala Gly Tyr Tyr Tyr 
180 185 190 

30 Ala Arg Phe Thr Val Gly Phe Ala He Pro Leu Ser He He Ala Phe 

195 200 205 

Thr Asn His Arg He Phe Arg Ser He Lys Gin Ser Met Gly Leu Ser 
210 215 220 

Ala Ala Gin Lys Ala Lys Val Lys His Ser Ala He Ala Val Val Val 
35 225 230 235 240 
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He Phe Leu Val Cys Phe Ala Pro Tyr His Leu Val Leu Leu Val Lys 
245 250 255 

Ala Ala Ala Phe Ser Tyr Tyr Arg Gly Asp Arg Asn Ala Met Cys Gly 
260 265 270 

5 Leu Glu Glu Arg Leu Tyr Thr Ala Ser Val Val Phe Leu Cys Leu Ser 

275 280 285 

Thr Val Asn Gly Val Ala Asp Pro He He Tyr Val Leu Ala Thr Asp 
290 295 300 

His Ser Arg Gin Glu Val Ser Arg He His Lys Gly Trp Lys Glu Trp 
10 305 310 315 320 

Ser Met Lys Thr Asp Val Thr Arg Leu Thr His Ser Arg Asp Thr Glu 
325 330 335 

, Glu Leu Gin Ser Pro Val Ala Leu Ala Asp His Tyr Thr Phe Ser Arg 

i 340 345 350 

15 | Pro Val His Pro Pro Gly Ser Pro Cys Pro Ala Lys Arg Leu He Glu 

355 360 365 

J Glu Ser Cys 
370 

;(26) INFORMATION FOR SEQ ID NO: 25: 

20 1 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 base pairs 

(B) TYPE: nucleic acid 

; (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ATGGCGAACT ATAGCCATGC AGCTGACAAC ATTTTGCAAA ATCTCTCGCC TCTAACAGCC 60 
TTTCTGAAAC TGACTTCCTT GGGTTTCATA ATAGGAGTCA GCGTGGTGGG CAACCTCCTG 12 0 
ATCTCCATTT TGCTAGTGAA AGATAAGACC TTGCATAGAG CACCTTACTA CTTCCTGTTG 180 
30 GATCTTTGCT GTTCAGATAT CCTCAGATCT GCAATTTGTT TCCCATTTGT GTTCAACTCT 24 0 
GTCAAAAATG GCTCTACCTG GACTTATGGG ACTCTGACTT GCAAAGTGAT TGCCTTTCTG 3 00 
GGGGTTTTGT CCTGTTTCCA CACTGCTTTC ATGCTCTTCT GCATCAGTGT C AC C AG AT AC 3 60 
TTAGCTATCG CCCATCACCG CTTCTATACA AAGAGGCTGA CCTTTTGGAC GTGTCTGGCT 42 0 
GTGATCTGTA TGGTGTGGAC TCTGTCTGTG GC CATGGCAT TTCCCCCGGT TTTAGACGTG 4 80 
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GGCACTTACT CATTCATTAG GGAGGAAGAT CAATGCACCT TCCAACAC CG CTCCTTCAGG 540 

GCTAATGATT C CTTAGG ATT TATGCTGCTT CTTGCTCTCA TCCTCCTAGC CACACAGCTT 60 0 

GTCTACCTCA AGCTGATATT TTTCGTCCAC GATCGAAGAA AAATGAAGCC AGTCCAGTTT 660 

GTAGCAGCAG TCAGCCAGAA CTGGACTTTT CATGGTCCTG GAGCCAGTGG CCAGGCAGCT 72 0 

(5 GCCAATTGGC TAGCAGGATT TGGAAGGGGT CCCACACCAC CCACCTTGCT GGGCATCAGG 7 80 

CAAAATGCAA ACAC CACAGG CAGAAGAAGG CTATTGGTCT TAGACGAGTT CAAAATGGAG 84 0 

AAAAGAATCA GCAGAATGTT CTATATAATG ACTTTTCTGT TTCTAACCTT GTGGGGCCCC 900 

TACCTGGTGG CCTGTTATTG GAGAGTTTTT GCAAGAGGGC CTGTAGT AC C AGGGGGATTT 960 

CTAACAGCTG CTGTCTGGAT GAGTTTTGCC CAAGCAGGAA TCAATCCTTT TGTCTGCATT 102 0 

10 TTCTCAAACA GGGAGCTGAG GCGCTGTTTC AGCACAACCC TTCTTTACTG CAGAAAATCC 10 80 

AGGTTACCAA GGGAACCTTA CTGTGTTATA TGA 1113r 

(27) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 70 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

20 Met Ala Asn Tyr Ser His Ala Ala Asp Asn lie Leu Gin Asn Leu Ser 

15 10 15 

Pro Leu Thr Ala Phe Leu Lys Leu Thr Ser Leu Gly Phe He He Gly 

20 25 30 

val Ser Val Val Gly Asn Leu Leu He Ser He Leu Leu Val Lys Asp 
25 35 40 45 

Lys Thr Leu His Arg Ala Pro Tyr Tyr Phe Leu Leu Asp Leu Cys Cys 
50 55 60 

Ser Asp He Leu Arg Ser Ala He Cys Phe Pro Phe Val Phe Asn Ser 

65 70 75 80 

30 Val Lys Asn Gly Ser Thr Trp Thr Tyr Gly Thr Leu Thr Cys Lys Val 

85 90 95 

He Ala Phe Leu Gly Val Leu Ser Cys Phe His Thr Ala Phe Met Leu 
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100 105 110 

phe Cys lie Ser Val Thr Arg Tyr Leu Ala lie Ala His His Arg Phe 
115 12 0 12 5 

/ Tyr Thr Lys Arg Leu Thr Phe Trp Thr Cys Leu Ala Val lie Cys Met 
5 130 135 140 

Val Trp Thr Leu Ser Val Ala Met Ala Phe Pro Pro Val Leu Asp val 
145 150 155 160 

Gly Thr Tyr Ser Phe lie Arg Glu Glu Asp Gin Cys Thr Phe Gin His 
165 170 175 

10 Arg Ser Phe Arg Ala Asn Asp Ser Leu Gly Phe Met Leu Leu Leu Ala 

180 185 190 

Leu lie Leu Leu Ala Thr Gin Leu Val Tyr Leu Lys Leu lie Phe Phe 
195 200 205 

Val His Asp Arg Arg Lys Met Lys Pro Val Gin Phe Val Ala Ala Val 
15 210 215 220 

Ser Gin Asn Trp Thr Phe His Gly Pro Gly Ala Ser Gly Gin Ala Ala 
225 230 235 240 

Ala Asn Trp Leu Ala Gly Phe Gly Arg Gly Pro Thr Pro Pro Thr Leu 
245 250 255 

20 Leu Gly lie Arg Gin Asn Ala Asn Thr Thr Gly Arg Arg Arg Leu Leu 

260 265 270 

Val Leu Asp Glu Phe Lys Met Glu Lys Arg lie Ser Arg Met Phe Tyr 
275 280 285 

lie Met Thr Phe Leu Phe Leu Thr Leu Trp Gly Pro Tyr Leu Val Ala 
25 290 295 300 

Cys Tyr Trp Arg Val Phe Ala Arg Gly Pro Val Val Pro Gly Gly Phe 
305 310 315 320 

Leu Thr Ala Ala Val Trp Met Ser Phe Ala Gin Ala Gly lie Asn Pro 
325 330 335 

30 Phe Val Cys He Phe Ser Asn Arg Glu Leu Arg Arg Cys Phe Ser Thr 

340 345 350 

Thr Leu Leu Tyr Cys Arg Lys Ser Arg Leu Pro Arg Glu Pro Tyr Cys 
355 360 365 

Val He 

35 370 

(28) INFORMATION FOR SEQ ID NO: 27: 
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(i) Sequence characteristics: 

(A) LENGTH: 1080 base pairs 
(&) TYPE: nucleic acid 
(CK STRANDEDNESS : single 
5 (D) \TOPOLOGY: linear 

\ 

(ii) MOLECUtE TYPE: DNA (genomic) 

\ 

(xi) SEQUENCE ^DESCRIPTION: SEQ ID NO: 27: 

\ 

ATGCAGGTCC CGAACAGCAC CGGCCCGGAC AACGCGACGC TGCAGATGCT GCGGAACCCG 6 0 
GCGATCGCGG TGGCCCTGCC\ CGTGGTGTAC TCGCTGGTGG CGGCGGTCAG CATCCCGGGC 12 0 

10 AACCTCTTCT CTCTGTGGGT GCTGTGC CGG CGCATGGGGC CCAGATCCCC GTCGGTCATC 180 
TTCATGATCA ACCTGAGCGT CACGGACCTG ATGCTGGCCA GCGTGTTGCC TTTCCAAATC 240 
TACT AC CATT GCAACCGCCA C CACTGGGT A TTCGGGGTGC TGCTTTGCAA CGTGGTGACC 3 00 
GTGGCCTTTT ACGCAAACAT GTATTCCAGC ATCCTCACCA TGAC CTGTAT CAGCGTGGAG 3 6<J 
CGCTTCCTGG GGGTCCTGTA CCCGCTCAGC TCCAAGCGCT GGCGCCGCCG TCGTTACGCG 420 

15 GTGGCCGCGT GTGCAGGGAC CTGGCTGOTG CTCCTGACCG CCCTGTGCCC GCTGGCGCGC 4 80 
ACCGATCTCA CCTACCCGGT GCACGCCCTG GGCATCATCA CCTGCTTCGA CGTCCTCAAG 54 0 
TGGACGATGC TCCCCAGCGT GGC CATGTGG GCCGTGTTCC TCTTCACCAT CTTCATC CTG 600 
CTGTTCCTCA TCCCGTTCGT GATCACCGTG GCTTGTTACA CGG CC AC CAT CCTCAAGCTG 660 
TTGCGCACGG AGGAGGCGCA CGGCCGGGAG CAGCGGAGGC GCGCGGTGGG CCTGGCCGCG 72 0 

20 GTGGTCTTGC TGGCCTTTGT CACCTGCTTC GCCCCCAACA ACTTCGTGCT CCTGGCGCAC 780 
ATCGTGAGCC GCCTGTTCTA CGGCAAGAGC TACTACCACG TGTACAAGCT CACGCTGTGT 840 
CTCAOCTGCC TCAACAACTG TCTGGACC CG TTTGTTTATT ACTTTGCGTC CCGGGAATTC 900 
CAGCTGCGCC TGCGGGAATA TTTGGGCTGC CGCCGGGTGC CCAGAGACAC C CTGGACACG 960 
CGCCGCGAGA GCCTCTTCTC CGCCAGGACC ACGTCCGTGC GCTCCGAGGC CGGTGCGCAC 102 0 

25 CCTGAAGGGA TGGAGGGAGC CACCAGGCCC GGCCTCCAGA GGCAGGAGAG TGTGTTCTGA1 0 80 

(2 9) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 9 amino acids 

(B) TYPE: amino acid 
30 (C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Gin Val Pro Asn Ser Thr Gly Pro Asp Asn Ala Thr Leu Gin Met 
15 10 15 

5 Leu Arg Asn Pro Ala lie Ala Val Ala Leu Pro Val Val Tyr Ser Leu 

20 25 30 

Val Ala Ala Val Ser lie Pro Gly Asn Leu Phe Ser Leu Trp Val Leu 
35 40 45 

Cys Arg Arg Met Gly Pro Arg Ser Pro Ser Val lie Phe Met lie Asn 
10 50 55 60 

Leu Ser Val Thr Asp Leu Met Leu Ala Ser Val Leu Pro Phe Gin lie 
65 70 75 80 

Tyr Tyr His Cys Asn Arg His His Trp Val Phe Gly Val Leu Leu Cys 
85 90 95 

15 Asn Val Val Thr Val Ala Phe Tyr Ala Asn Met Tyr Ser Ser lie Leu 

100 105 110 

Thr Met Thr Cys lie Ser Val Glu Arg Phe Leu Gly Val Leu Tyr Pro 
115 120 125 

Leu Ser Ser Lys Arg Trp Arg Arg Arg Arg Tyr Ala Val Ala Ala Cys 
20 130 135 140 

Ala Gly Thr Trp Leu Leu Leu Leu Thr Ala Leu Cys Pro Leu Ala Arg 
145 150 155 160 

Thr Asp Leu Thr Tyr Pro Val His Ala Leu Gly lie lie Thr Cys Phe 
165 170 175 

25 Asp Val Leu Lys Trp Thr Met Leu Pro Ser Val Ala Met Trp Ala Val 

180 185 190 

Phe Leu Phe Thr lie Phe lie Leu Leu Phe Leu lie Pro Phe Val lie 
195 200 205 

Thr Val Ala Cys Tyr Thr Ala Thr lie Leu Lys Leu Leu Arg Thr Glu 
30 210 215 220 

Glu Ala His Gly Arg Glu Gin Arg Arg Arg Ala Val Gly Leu Ala Ala 
225 230 235 240 

Val Val Leu Leu Ala Phe Val Thr Cys Phe Ala Pro Asn Asn Phe Val 
245 250 255 



35 Leu Leu Ala His lie Val Ser Arg Leu Phe Tyr Gly Lys Ser Tyr Tyr 

260 265 270 
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His Val Tyr Lys Leu Thr Leu Cys Leu Ser Cys Leu Asn Asn Cys Leu 
2*5 280 285 

Asp Pro Phe Val Tyr Tyr Phe Ala Ser Arg Glu Phe Gin Leu Arg Leu 
290 \ 295 300 

5 Arg Glu Tyr Leu\Gly Cys Arg Arg Val Pro Arg Asp Thr Leu Asp Thr 

305 \ 310 315 320 

\ 

Arg Arg Glu Ser Leu, Phe Ser Ala Arg Thr Thr Ser Val Arg Ser Glu 

325\ 330 335 

Ala Gly Ala His Pro Glu Gly Met Glu Gly Ala Thr Arg Pro Gly Leu 
10 340 345 350 

Gin Arg Gin Glu Ser Val P{ie 
355 

(30) INFORMATION FOR SEQ ID NO: 2*: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1503 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGGAGCGTC CCTGGGAGGA CAGCCCAGGC CCGGAGGGGG CAGCTGAGGG CTCGCCTGTG 6 0 
CCAGTCGCCG CCGGGGCGCG CTCCGGTGCC GCGGCGAGTG GCACAGGCTG GCAGCCATGG 12 0 
GCTGAGTGCC CGGGACCCAA GGGGAGGGGG CAACTGCTGG CGACCGCCGG CCCTTTGCGT 180 
CGCTGGCCCG CCCCCTCGCC TGCCAGCTCC AGCCCCGCCC CCGGAQCGGC GTCCGCTCAC 24 0 

25 TCGQTTCAAG GCAGCGCGAC TGCGGGTGGC GCACGACCAG GGCGCAGACC TTGGGGCGCG 3 00 
CGGCCCATGG AGTCGGGGCT GCTGCGGCCG GCGC CGGTGA GCGAGGTCAT CGTCCTGCAT 360 
TACAACTACA CCGGCAAGCT CCGCGGTGCG AGCTACCAGC CGGGTGCCGG CCTGCGCGCC 42 0 
GACGCCGTGG TGTGCCTGGC GGTGTGCGCC TTCATCGTGC TAGAGAATCT AGCCGTGTTG 4 80 
TTGGTGCTCG GACGCCACCC GCGCTTCCAC GCTCCCATGT TCCTGCTCCT GGGCAGCCTC 54 0 

30 ACGTTGTCGG ATCTGCTGGC AGGCGCCGCC TACGCCGCCA ACATCCTACT GTCGGGGCCG 60 0 
CTCACGCTGA AACTGTCCCC CGCGCTCTGG TTCGCACGGG AGGGAGGCGT CTTCGTGGCA 660 
CTCACTGCGT CCGTGCTGAG CCTCCTGGCC ATCGCGCTGG AGCGCAGCCT CACCATGGCG 72 0 
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CGCAGGGGGC CCGCGCCCGT CTCCAGTCGG GGGCGCACGC TGGCGATGGC AGCCGCGGCC 780 
TGGGGCGTGT CGCTGCTCCT CGGGCTCCTG CCAGCGCTGG GCTGGAATTG CCTGGGTCGC 84 0 
! CTGGACGCTT GCTCCACTGT CTTGCCGCTC TACGCCAAGG CCTACGTGCT CTTCTGCGTG 900 
\ CTCGCCTTCG TGGGC*TCCT\ GGCCGCGATC TGTGCACTCT ACGCGCGCAT CTACTGCCAG 960 
5 GTACGCGCCA ACGCGCC^CG CC^CGGCA CGGCCCGGGA CTGCGGGGAC CACCTCGACC1020 
; CGGGCGCGTC GCAAGCCGCG CT^CTGGCC TTGCTGCGCA CGCTCAGCGT GGTGCTCCTG1 0 8 0 
GCCTTTGTGG CATGTTGGGG CCCCCTCTTC CTGCTGCTGT TGCTCGACGT GGCGTGC C CG 1140 
GCGCGCACCT GTC CTGT ACT CCTGCAGGCC GATCCCTTCC TGGGACTGGC CATGGCCAAC 1200 
TCACTTCTGA ACCCCATCAT CTACACGCTC ACCAACCGCG ACCTGCGCCA CGCGCTCCTG1260 
10CGCCTGGTCT GCTGCGGACG CCACTCCTGC GGCAGAGACC CGAGTGGCTC CCAGCAGTCG1320 
GCGAGCGCGG CTGAGGCTTC CGGGGGCCTG CGCCGCTGCC TGCCCCCGGG CCTTGATGGG13 80 
AGCTTCAGCG GCTCGGAGCG CTCATCGCCC CAGCGCGACG GGCTGGACAC CAGCGGCTCC1440 
ACAGGCAGCC CCGGTGCACC CACAGCCGCC OGGACTCTGG TATCAGAACC GGCTGCAGAC 1500 

1503 

TGA 

15 (31) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 00 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

20 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Glu Arg Pro Trp Glu Asp Ser Pro Gly Pro Glu Gly Ala Ala Glu 
15 10 15 

25 Gly Ser Pro Val Pro Val Ala Ala Gly Ala Arg Ser Gly Ala Ala Ala 

20 25 30 

Ser Gly Thr Gly Trp Gin Pro Trp Ala Glu Cye Pro Gly Pro Lys Gly 
35 40 45 

Arg Gly Gin Leu Leu Ala Thr Ala Gly Pro Leu Arg Arg Trp Pro Ala 
30 50 55 60 



Pro Ser Pro Ala Ser Ser Ser 



Pro Ala Pro Gly Ala Ala Ser Ala His 



65 70 75 80 



\ 
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h f ' ser val Gin Gly Ser Ala Thr Ala Gly Gly Ala Arg Pro Gly Arg Arg 

85 9° 95 



15 



Pro Trp Gly^Ala Arg Pro Met Glu Ser Gly Leu Leu Arg Pro Ala Pro 
^.00 



Val Se 

115 



105 HO 
r Glu val lie Val Leu His Tyr Asn Tyr Thr Gly Lys Leu Arg 



120 125 



Gly Ala Ser TyrWn Pro Gly Ala Gly Leu Arg Ala Asp Ala Val Val 
130 \ 135 140 

\ 



Cys Leu Ala Val Cye Ala Phe He Val Leu Glu Aan Leu Ala Val Leu 
145 150 155 

Leu Val Leu Gly Arg V» Pro ^9 Phe Hia Ala Pr ° M<5t Ph ° ^ ^ 

1S5 \ 170 175 

Leu Gly Ser Leu Thr Lev Ser Asp Leu Leu Ala Gly Ala Ala Tyr Ala 
180 185 I" 

Ala Asn He Leu Leu Ser qiy Pro Leu Thr Leu Lys Leu Ser Pro Ala 



195 



200 205 



Leu Trp Phe Ala Arg Glu Gly, Gly Val Phe Val Ala Leu Thr Ala Ser 

210 215\ 220 



Val Leu Ser Leu Leu Ala He A^La Leu Glu Arg Ser Leu Thr Met Ala 
20 225 



230 235 240 



Arg Arg Gly Pro Ala Pro Val Ser Ser Arg Gly Arg Thr Leu Ala Met 
245 250 255 

i Ala Ala Ala Ala Trp Gly Val Ser Leu Leu Leu Gly Leu Leu Pro Ala 

260 265 270 

i 

25 Leu Gly Trp Asn Cys Leu Gly Arg Leu Asp Ala Cys Ser Thr Val Leu 

275 280 285 

Pro Leu Tyr Ala Lys Ala Tyr Val Leu Phe Cys Val Leu Ala Phe Val 
290 295 300 

Gly He Leu Ala Ala He Cys Ala Leu Tyr Ala Arg He Tyr Cys Gin 
30 305 310 315 320 

Val Arg Ala Asn Ala Arg Arg Leu Pro Ala Arg Pro Gly Thr Ala Gly 
325 330 335 

Thr Thr Ser Thr Arg Ala Arg Arg Lys Pro Arg Ser Leu Ala Leu Leu 
340 345 350 

35 Arg Thr Leu Ser Val Val Leu Leu Ala Phe Val Ala Cys Trp Gly Pro 

355 360 365 

Leu Phe Leu Leu Leu Leu Leu Asp Val Ala Cys Pro Ala Arg Thr Cys 
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375 380 



Pro Val Leu Leu Gin Ala Asp Pro Phe Leu Gly Leu Ala Met Ala Asn 

385 \ 395 

Ser Leu Leu W Pro He He Tyr Thr Leu Thr Asn Arg Asp Leu Arg 

410 415 



10 



405 

His Ala Leu Lei Arg Leu Val Cys Cys Gly Arg His Ser Cys Gly Arg 
420 \ 425 430 

Asp Pro Ser Gly Ser Gin Gin Ser Ala Ser Ala Ala Glu Ala Ser Gly 
435 440 445 

Gly Leu Arg Arg Cys Leu Pro Pro Gly Leu Asp Gly Ser Phe Ser Gly 
450 \ 4 *5 460 



Ser Glu Arg Ser Ser Pro Gin Arg Asp Gly Leu Asp Thr Ser Gly Ser 
465 470 4 75 4 80 

Thr Gly Ser Pro Gly Ala Ptto Thr Ala Ala Arg Thr Leu Val Ser Glu 
15 485 4 *0 4 9 5 

Pro Ala Ala Asp 
500 

(32) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 102 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3l: 

ATGCAAGCCG TCGACAATCT CACCTCTGCG CCTGGGAACA CCAGTCTGTG CACCAGAGAC 60 
TACAAAATCA CCCAGGTCCT CTTCCCACTG CTCTACACTG TCCTGTTTTT TGTTGGACTT 12 0 
ATCACAAATG GCCTGGCGAT GAGGATTTTC TTTCAAATCC GGAGTAAATC AAACTTTATT 180 
ATTTTTCTTA AGAACACAGT CATTTCTGAT CTTCTCATGA TTCTGACTTT TCCATTCAAA 24 0 
30 ATTCTTAGTG ATGCCAAACT GGGAACAGGA CCACTGAGAA CTTTTGTGTG TCAAGTTACC 3 00 
TCCGTCATAT TTTATTTCAC AATGTATATC AGTATTTCAT TC CTGGGACT GATAACTATC 360 
GATCGCTACC AGAAGACCAC C AGGC CATTT AAAACATCCA ACCCCAAAAA TCTCTTGGGG 420 
GCTAAGATTC TCTCTGTTGT CATCTGGGCA TTCATGTTCT TACT CTCTTT GCCTAACATG 4 80 
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ATTCTGACCA ACAGGCAGCC GAGAGACAAG AATGTGAAGA AATGCTCTTT CCTTAAATCA 54 0 
GAGTTCGGTC TAGTCTGGCA TGAAATAGTA AATTACATCT GTCAAGTCAT TTTCTGGATT 600 
AATTTCTTAA TTGTTATTGT ATGTTATACA CTCATTACAA AAGAACTGTA CCGGTCATAC 660 
GTAAGAACGA GGGGTGTAGG TAAAGTCCCC AGGAAAAAGG TGAACGTCAA AGTTTTCATT 720 
5ATCATTGCTG TATTCTTTAT TTGTTTTGTT CCTTTCCATT TTGCCCGAAT TCCTTACACC 7 80 
CTGAGCCAAA CCCGGGATdT CTTTGACTGC ACTGCTGAAA ATACTCTGTT CTATGTGAAA 840 
GAGAGCACTC TGTGGTTAAC TTCCTTAAAT GCATGCCTGG ATCCGTTCAT CTATTTTTTC 900 
CTTTGCAAGT CCTTCAGAAA TTCCTTGATA AGTATGCTGA AGTGCCCCAA TTCTGCAACA 960 
TCTCTGTCCC AGGACAATAG GAAAAAAGAA CAGGATGGTG GTGACCCAAA TGAAGAGACT 1020 

1029 

1 0 CCAATGTAA 

(33) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



20 



25 



Met Gin Ala Val Asp Asn Leu Thr Ser Ala Pro Gly Asn Thr Ser Leu 
1 5 io 15 

Cys Thr Arg Asp Tyr Lys He Thr Gin Val Leu Phe Pro Leu Leu Tyr 
20 25 30 

Thr Val Leu Phe Phe Val Gly Leu He Thr Asn Gly Leu Ala Met Arg 
35 40 4 5 

He Phe Phe Gin He Arg Ser Lys Ser ASn Phe He He Phe Leu Lys 

50 55 60 

Asn Thr Val He Ser Asp Leu Leu Met He Leu Thr Phe Pro Phe Lys 
65 70 75 80 

He Leu Ser Asp Ala Lys Leu Gly Thr Gly Pro Leu Arg Thr Phe Val 
30 85 90 95 

Cys Gin Val Thr Ser Val He Phe Tyr Phe Thr Met Tyr He Ser He 
100 105 110 

Ser Phe Leu Gly Leu He Thr He Asp Arg Tyr Gin Lys Thr Thr Arg 
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Pro Phe Lys Thr Ser Asn Pro Lys Asn Leu Leu Gly Ala Lys He Leu 

130 "0 
ser Val Val He Trp Ala Phe Met Phe Leu Leu Ser Leu Pro Asn Met 



10 



145 I" 155 

t,*»u Thr Asn 

16S 170 



He Leu Thr Asn Arg Gin Pro Arg Asp Lys Asn Val Lys Lys Cys Ser 



Phe Leu Lys Ser Glu Phe Gly Leu Val Trp His Glu He Val Asn Tyr 
180 185 

lie Cys Gin Val He Phe Trp He Asn Phe Leu He Val He Val Cys 
195 200 205 

Tvr Thr Leu lie Thr Lye Glu Leu Tyr Arg Ser Tyr Val Arg Thr Arg 
^ 210 215 220 

Gly Val Gly Lys Val Pro Arg Lys Lys Val Asn Val Lys Val Phe He 
15 225 230 235 

He He Ala Val Phe Phe He Cys Phe Val Pro Phe His Phe Ala Arg 



245 



250 255 



He Pro Tyr Thr Leu Ser Gin Thr Arg Asp Val Phe Asp Cys Thr Ala 
260 265 270 

20 Glu Asn Thr Leu Phe Tyr Val Lys Glu Ser Thr Leu Trp Leu Thr Ser 

275 280 285 

Leu Asn Ala Cys Leu Asp Pro Phe He Tyr Phe Phe Leu Cys Lys Ser 
290 295 300 

Phe Arg Asn Ser Leu He Ser Met Leu Lys Cys Pro Asn Ser Ala Thr 

-» -i « 315 320 

25 305 310 Ji;> 

Ser Leu Ser Gin Asp Asn Arg Lys Lys Glu Gin Asp Gly Gly Asp Pro 
325 

Asn Glu Glu Thr Pro Met 
340 

30 (34) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1077 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 



330 335 



(ii) MOLECULE TYPE : DNA (genomic) 
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\xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
ATGTCGGTCT GCTACCGTCC CCCAGGGAAC GAGACACTGC TGAGCTGGAA GACTTCGCGG 60 
GCCACAGGCA CAGCCTTCCT GCTGCTGGCG GCGCTGCTGG GGCTGCCTGG CAACGGCTTC 12 0 
GTGGTGTGGA GCTTGGCGGG CTGGCGGCCT GCACGGGGGC GACCGCTGGC GGCCACGCTT 180 
5 GTGCTGCACC TGGCGCTGGC CGACGGCGCG GTGCTGCTGC TCACGCCGCT CTTTGTGGCC 240 
TTCCTGACCC GGCAGGvCCTG GCCGCTGGGC CAGGCGGGCT GCAAGGCGGT GTACTACGTG 3 00 
TGCGCGCTCA GCATGTACGC CAGCGTGCTG CTCACCGGCC TGCTCAGCCT GCAGCGCTGC 360 
CTCGCAGTCA CCCGCCCCT* CCTGGCGCCT CGGCTGCGCA GCCCGGCCCT GGCCCGCCGC 420 
CTGCTGCTGG CGGTCTGGCT GGCCGCCCTG TTGCTCGCCG TCCCGGCCGC CGTCTACCGC 480 
10 CACCTGTGGA GGGACCGCGT ATGCCAGCTG TGCCACCCGT CGCCGGTCCA CGCCGCCGCC 54 0 
CACCTGAGCC TGGAGACTCT GACCGCTTTC GTGCTTCCTT TCGGGCTGAT GCTCGGCTGC 60a 
TACAGCGTGA CGCTGGCACG GCTGCGGGGC GCCCGCTGGG GCTCCGGGCG GCACGGGGCG 66a 
CGGGTGGGCC GGCTGGTGAG CGCCATCGTG CTTGCCTTCG GCTTGCTCTG GGCCCCCTAC 72 0 
CACGCAGTCA ACCTTCTGCA GGCGGTCGCA GCGCTGGCTC CACCGGAAGG GGCCTTGGCG 780 
1 5 AAGCTGGGCG GAGC CGGCCA GGCGGCGCGA GCGGGAACTA CGGCCTTGGC CTTCTTCAGT 840 
TCTAGCGTCA AC CCGGTGCT CTACGTCTTC AC CGCTGGAG ATCTGCTGCC CCGGGCAGGT 90 0 
CCCCGTTTCC TCACGCGGCT CTTCGAAGGC TCTGGGGAGG CCCGAGGGGG CGGCCGCTCT 960 
AGGGAAGGGA CCATGGAGCT CCGAACTACC CCTCAGCTGA AAGTGGTGGG GCAGGGCCGC 102 0 
GGCAATGGAG ACCCGGGGGG TGGGATGGAG AAGGACGGTC CGGAATGGGA CCTTTGA 1077 
20(35) INFORMATION POR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 58 amino acids 

(B) TYPB: amino acid 

( C ) STRAND EDNESS : 

25 (D) TOPOLOGY : not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Ser Val Cys Tyr Arg Pro Pro Gly Asn Glu Thr Leu Leu Ser Trp 
15 10 15 

30 Lys Thr Ser Arg Ala Thr Gly Thr Ala Phe Leu Leu Leu Ala Ala Leu 
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10 



20 



25 30 



Gly Leu Pro Gly Asn Gly Phe Val Val Trp Ser Leu Ala Gly Trp 
35 40 45 



Arg Pro Ala Arg Gly Arg Pro Leu 
50 



Ala Ala Thr Leu Val Leu His Leu 



55 6° 



Ala Leu Ala Asp Gly Ala Val Leu Leu Leu Thr Pro Leu Phe Val Ala 



65 



70 75 80 



Phe Leu Thr Arg Gin Ala Trp Pro Leu Gly Gin Ala Gly Cys Lys Ala 
85 90 95 

Val Tyr Tyr Val Cys Ala Leu Ser Met Tyr Ala Ser Val Leu Leu Thr 
100 I© 5 110 

Gly Leu Leu Ser Leu Gin Arg Cys Leu Ala Val Thr Arg Pro Phe Leu 
11S 120 125 



Ala Pro Arg Leu Arg Ser Pro 



Ala Leu Ala Arg Arg Leu Leu Leu Ala 



15 130 135 140 

Val Trp Leu Ala Ala Leu Leu Leu Ala Val Pro Ala Ala Val Tyr Arg 
145 150 155 160 

His Leu Trp Arg Asp Arg Val Cys Gin Leu Cys His Pro Ser Pro Val 
165 170 I 75 



His Ala Ala Ala His Leu Ser Leu Glu Thr Leu Thr Ala Phe Val Leu 
180 185 190 

Pro Phe Gly Leu Met Leu Gly Cys Tyr Ser Val Thr Leu Ala Arg Leu 
195 200 205 

Arg Gly Ala Arg Trp Gly Ser Gly Arg His Gly Ala Arg Val Gly Arg 
25 210 215 220 

Leu Val Ser Ala lie Val Leu Ala Phe Gly Leu Leu Trp Ala Pro Tyr 
225 230 235 240 

His Ala Val Ann Leu Leu Gin Ala Val Ala Ala Leu Ala Pro Pro Glu 
245 250 255 

30 Gly Ala Leu Ala Lys Leu Gly Gly Ala Gly Gin Ala Ala Arg Ala Gly 

260 265 270 

Thr Thr Ala Leu Ala Phe Phe Ser Ser Ser Val Asn Pro Val Leu Tyr 
275 280 285 

Val Phe Thr Ala Gly Asp Leu Leu Pro Arg Ala Gly Pro Arg Phe Leu 
35 290 295 300 

Thr Arg Leu Phe Glu Gly Ser Gly Glu Ala Arg Gly Gly Gly Arg Ser 
305 310 315 320 
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Arg Glu Gly Thr Met Glu Leu Arg Thr Thr Pro Gin Leu Lys Val Val 
325 330 335 

Gly Gin Gly Arg Gly Asn Gly Asp Pro Gly Gly Gly Met Glu Lys Asp 
340 345 350 

5 Gly Pro Glu Trp Asp Leu 

355 

(3 6) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 
IQ (b) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
15ATGCTGGGGA TCATGGCATG GAATGCAACT TGCAAAAACT GGCTGGCAGC AGAGGCTGCC 60 
CTGGAAAAGT ACTACCTTTC CATTTTTTAT GGGATTGAGT TCGTTGTGGG AGTC CTTGGA 12 0 
AAT AC CATTG TTGTTTACGG CTACATCTTC TCTCTGAAGA ACTGGAACAG CAGTAATATT 180 
TATCTCTTTA AC CTCTCTGT CTCTGACTTA GCTTTTCTGT GCACCCTCCC CATGCTGATA 24 0 
AGGAGTTATG CCAATGGAAA CTGGATATAT GGAGACGTGC TCTGCATAAG CAACCGATAT 300 
20 GTGCTTCATG CCAACCTCTA TACCAGCATT CTCTTTCTCA CTTTT AT CAG CATAGATCGA 360 
TACTTGATAA TTAAGTATCC TTTCCGAGAA CACCTTCTGC AAAAGAAAGA GTTTGCTATT 42 0 
TTAATCTCCT TGGCCATTTG GGTTTTAGTA ACCTTAGAGT TACTACCCAT ACTTCCCCTT 4 80 
ATAAATCCTG TTATAACTGA CAATGGCAC C ACCTGTAATG ATTTTGCAAG TTCTGGAGAC 54 0 
CCCAACTACA ACCTCATTTA CAGCATGTGT CTAACACTGT TGGGGTTCCT TATTCCTCTT 6 00 
25 TTTGTGATGT GTTTCTTTTA TTACAAGATT GCTCTCTTCC TAAAGCAGAG GAATAGGCAG 660 
GTTGCTACTG CTCTGCCCCT TGAAAAGCCT CTCAACTTGG TCATCATGGC AGTGGTAATC 72 0 
TTCTCTGTGC TTTTTACACC CTATCACGTC ATGCGGAATG TGAGGATCGC TTCACGCCTG 780 
GGGAGTTGGA AGCAGTATCA GTGCACTCAG GTCGTCATCA ACT C CTTTT A CATTGTGACA 84 0 
CGGC CTTTGG CCTTTCTGAA CAGTGTCATC AACCCTGTCT TCTATTTTCT TTTGGGAGAT 900 
30 CACTTCAGGG ACATGCTGAT G AAT CAACTG AGACACAACT TCAAATCCCT TACATCCTTT 96 0 
AGCAGATGGG CTCATGAACT C CT ACTTTCA TT CAG AG AAA AGTGA 1005 
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(37) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Leu Gly He Met Ala Trp Asn Ala Thr Cys Lys Asn Trp Leu Ala 
10 1 5 10 15 

Ala Glu Ala Ala Leu Glu Lys Tyr Tyr Leu Ser He Phe Tyr Gly He 
20 25 30 

Glu Phe Val Val Gly Val Leu Gly Asn Thr He Val Val Tyr Gly Tyr 
35 40 45 



15 



lie Phe Ser Leu Lys Asn Trp Asn Ser Ser Asn He Tyr Leu Phe Asn 
50 55 60 



Leu Ser Val Ser Asp Leu 
65 



Ala Phe Leu Cys Thr Leu Pro Met Leu He 
70 75 80 



atq Ser Tyr Ala Asn Gly Asn Trp He Tyr Gly Asp Val Leu Cys He 
20 85 90 95 

Ser Asn Arg Tyr Val Leu His Ala Asn Leu Tyr Thr Ser He Leu Phe 
100 105 HO 

Leu Thr Phe He Ser He Asp Arg Tyr Leu He He Lys Tyr Pro Phe 
U5 120 I 25 

25 Arg Glu His Leu Leu Gin Lys Lys Glu Phe Ala He Leu He Ser Leu 

130 135 140 

Ala He Trp Val Leu Val Thr Leu Glu Leu Leu Pro He Leu Pro Leu 
145 150 155 160 

He Asn Pro Val He Thr Asp Asn Gly Thr Thr Cys Asn Asp Phe Ala 
30 165 170 175 

Ser Ser Gly Asp Pro Asn Tyr Asn Leu He Tyr Ser Met Cys Leu Thr 
180 185 190 

Leu Leu Gly Phe Leu He Pro Leu Phe Val Met Cys Phe Phe Tyr Tyr 
195 200 205 

35 Lys He Ala Leu Phe Leu Lys Gin Arg Asn Arg Gin Val Ala Thr Ala 

210 215 220 



AP 



AREN-0050 - 76 - PATENT 

Leu Pro Leu Glu Lys Pro Leu Asn Leu Val He Met Ala Val Val He 
225 230 235 240 

Phe Ser Val Leu Phe Thr Pro Tyr His Val Met Arg Asn Val Arg He 
245 250 255 

5 Ala Ser Arg Leu Gly Ser Trp Lys Gin Tyr Gin Cys Thr Gin Val Val 

260 265 270 

He Asn Ser Phe Tyr He Val Thr Arg Pro Leu Ala Phe Leu Asn Ser 
275 280 285 

Val He Asn Pro Val Phe Tyr Phe Leu Leu Gly Asp His Phe Arg Asp 
10 290 295 300 

Met Leu Met Asn Gin Leu Arg His Asn Phe Lys Ser Leu Thr Ser Phe 
305 310 315 320 

Ser Arg Trp Ala His Glu Leu Leu Leu Ser Phe Arg Glu Lys 
325 330 

15 (38) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ATGCAGGCGC TTAACATTAC CCCGGAGCAG TTCTCTCGGC TGCTGCGGGA CCACAACCTG 60 
ACGCGGGAGC AGTTCATCGC TCTGTACCGG CTGCGACCGC TCGTCTACAC CCCAGAGCTG 12 0 

25 CCGGGACGCG CCAAGCTGGC CCTCGTGCTC ACCGGCGTGC TCATCTTCGC CCTGGCGCTC 18 0 
TTTGGCAATa CTCTGGTGTT CTACGTGGTG ACCCGCAGCA AGGCCATGCG CACCGTCACC 24 0 
AACATCTTTA TCTGCTCCTT GGCGCTCAGT GACCTGCTCA TCACCTTCTT CTGCATTCCC 3 00 
GTCACCATGC T C CAG AAC AT TTCCGACAAC TGGCTGGGGG GTGCTTTCAT TTGCAAGATG 360 
GTGCCATTTG TCCAGTCTAC CGCTGTTGTG ACAGAAATGC TCACTATGAC CTGCATTGCT 42 0 

30 GTGGAAAGGC ACCAGGGACT TGTGCATCCT TTTAAAATGA AGTGGCAATA CACCAACCGA 4 80 
AGGGCTTTCA CAATGCTAGG TGTGGTCTGG CTGGTGGCAG TCATCGTAGG ATCACCCATG 54 0 
TGGCACGTGC AACAACTTGA GATCAAATAT GACTTC CT AT ATGAAAAGGA ACAC AT CTGC 60 0 
TGCTTAGAAG AGTGGACCAG CCCTGTGCAC CAGAAGATCT ACACCACCTT CATCCTTGTC 66 0 
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ATCCTCTTCC TCCTGCCTCT tatggtgatg CTTATTCTGT ACAGTAAAAT TGGTTATGAA 720 
CTTTGGATAA AGAAAAGAGT TGGGGATGGT TCAGTGCTTC GAACTATTCA TGGAAAAGAA 780 
ATGTCCAAAA TAGCQAGGAA GAAGAAACGA GCTGTCATTA TGATGGTGAC AGTGGTGGCT 840 
CTCTTTGCTG TGTGcJtGGGC ACCATTCCAT GTTGTCCATA TGATGATTGA ATACAGTAAT 900 
5 TTTGAAAAGG AATAlioATGA TGTCACAATC AAGATGATTT TTGCTATCGT GCAAATTATT 960 
GGATTTTCCA ACTCC^TCTG TAATCCCATT GTCTATGCAT TTATGAATGA AAACTTCAAAl 02 0 
AAAAATGTTT TGTCTC^CAGT TTGTTATTGC ATAGTAAATA AAACCTTCTC TCCAGCACAA1 0 80 
AGGCATGGAA ATTCAdpAAT TACAATGATG CGGAAGAAAG CAAAGTTTTC CCTCAGAGAG 1140 
AATC CAGTGG AGGAAA^CAA AGGAGAAGCA TTCAGTGATG GCAACATTGA AGTCAAATTG12 00 
10TGTGAACAGA CAGAGGAGAA GAAAAAGCTC AAACGACAT C TTGCTCTCTT TAGGTCTGAA1 260 
CTGGCTGAGA ATTCTCCTTT AGACAGTGGG CATTAA 1296 

(3 9) INFORMATION FO^R SEQ ID NO: 38: 

\ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHy 431 amino acids 
15 (b) TYPE: ^mino acid 

(C) STRANDESdNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE i protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Gin Ala Leu Asn Tie Thr Pro Glu Gin Phe Ser Arg Leu Leu Arg 
q 10 15 



20 



Asp Hi 8 Asn Leu Thr Arg Glu Gin Phe He Ala Leu Tyr Arg Leu Arg 
20 25 30 

Pro Leu Val Tyr Thr Pro Glu Leu Pro Gly Arg Ala Lys Leu Ala Leu 
25 35 40 45 

Val Leu Thr Gly Val Leu He Phe Ala Leu Ala Leu Phe Gly Asn Ala 
50 55 60 

Leu Val Phe Tyr Val Val Thr Arg Ser Lys Ala Met Arg Thr Val Thr 
65 70 75 80 

30 Asn He Phe He Cys Ser Leu Ala Leu Ser Asp Leu Leu He Thr Phe 

85 90 95 

Phe Cys He Pro Val Thr Met Leu Gin Asn He Ser Asp Asn Trp Leu 
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Gly Gly Ala Phe He Cys Lys Met Val Pro Phe Val Gin Ser Thr Ala 
115 120 125 

val Val Thr Glu Met Leu Thr Met Thr Cys He Ala Val Glu Arg His 
130 135 140 

Gin Gly Leu Val His Pro Phe Lys Met Lys Trp Gin Tyr Thr Asn Arg 
145 150 155 160 

Arq Ala Phe Thr Met Leu Gly Val Val Trp Leu Val Ala Val He Val 
165 170 I 75 

Glv Ser Pro Met Trp His Val Gin Gin Leu Glu He Lys Tyr Asp Phe 
180 185 190 

Leu Tyr Glu Lys Glu His He Cys Cys Leu Glu Glu Trp Thr Ser Pro 
19 5 200 205 

Val His Gin Lys He Tyr Thr Thr Phe He Leu Val He Leu Phe Leu 
15 210 215 220 

Leu Pro Leu Met Val Met Leu He Leu Tyr Ser Lys He Gly Tyr Glu 
225 230 235 240 

Leu Trp He Lys Lys Arg Val Gly Asp Gly Ser Val Leu Arg Thr He 
245 250 255 

20 His Gly Lys Glu Met Ser Lys He Ala Arg Lys Lys Lys Arg Ala Val 

260 265 270 

He Met Met Val Thr Val Val Ala Leu Phe Ala Val Cys Trp Ala Pro 
275 280 285 

Phe His Val Val His Met Met He Glu Tyr Ser Asn Phe Glu Lys Glu 
25 290 295 300 

Tyr Asp Asp Val Thr He Lys Met He Phe Ala He Val Gin He He 
305 310 315 320 

Gly Phe Ser Asn Ser He Cys Asn Pro He Val Tyr Ala Phe Met Asn 
325 330 335 

30 Glu Asn Phe Lys Lys Asn Val Leu Ser Ala Val Cys Tyr Cys He Val 

340 345 350 

Asn Lys Thr Phe Ser Pro Ala Gin Arg His Gly Asn Ser Gly He Thr 
355 360 365 

Met Met Arg Lys Lys Ala Lys Phe Ser Leu Arg Glu Asn Pro Val Glu 
35 370 375 380 

Glu Thr Lys Gly Glu Ala Phe Ser Asp Gly Asn He Glu Val Lys Leu 
385 390 395 400 
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eye Glu GlA Thr Glu Glu Lys Lys Lys Leu Lys Arg His Leu Ala Leu 



405 410 

Phe Arg Ser Glu Leu Ala Glu Asn Ser Pro Leu Asp Ser Gly His 
4i0 425 430 



5 (4 0) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CTGTGTACAG CAGTTCGCAG AGTG 
(41) INFORMATION FOR SEQ ID NO: 40: 

!5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

G AGTGC CAGG CAGAGCAGGT AGAC 

(42) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 
30 (iv) ANT I - SENSE : NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCCGAATTCC TGCTTGCTCC CAGCTTGGC C C 
(43) INFORMATION FOR SEQ ID NO: 42: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI- SENSE* YES 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
TGTGGATCCT GCTGTCAAAG GTCCCATTCC GG 
10 (44) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iv) ANT I - SENSE : NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
TCACAATGCT AGGTGTGGTC 
20 (45) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANT I - SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
TGCATAGACA ATGGGATTAC AG 
30 (4 6) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

i (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

■ TCACAATGCT AGGTGTgWc TGGCTGGTGG CAGTCATCGT AGGATCACCC ATGTGGCACG 60 
: 5TGCAACAACT TGAGATCA^A TATGACTTCC TATATGAAAA GGAACACATC TGCTGCTTAG 12 0 
; AAGAGTGGAC CAGCCCTGTO. CACCAGAAGA TCTACACCAC CTTCATCCTT GTCATCCTCT 180 
! TCCTCCTGCC TCTTATGGTG \aTGCTTATTC TGTACGTAAA ATTGGTTATG AACTTTGGAT 24 0 
AAAGAAAAGA GTTGGGGATG gVtCAGTGCT TCGAACTATT CATGGAAAAG AAATGTCCAA 300 
AATAGCCAGG AAGAAGAAAC GAGCTGTCAT TATGATGGTG ACAGTGGTGG CTCTCTTTGC 360 
1 0 TGTGTGCTGG GCACCATTCC ATGtTGTCCA TATGATGATT GAATACAGTA ATTTTGAAAA 420 
; GGAATATGAT GATGTCACAA TCAAOATGAT TTTTGCTATC GTGCAAATTA TTGGATTTTC 480 
CAACTCCATC TGTAATCCCA TTGTCTfcTGC A 
(47) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 21 base pa^rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CTGCTTAGAA GAGTGGACCA G 

(48) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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30 (iv) ANTI -SENSE: NO 
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|*P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTGTGCACCA GAAGATCTAC AC 

(4 9) INFORMATION FOR SEQ ID NO: 48: 

( i ) SEQUENCE \CHARACTERISTICS : 
5 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOQY: linear 

(ii) MOLECULE TYPfc: DNA (genomic) 
10 (iv) ANT I - SENSE : YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CAAGGATGAA GGTGGTGTAG A 
(50) INFORMATION FOR SEQ IP NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTGTAGATCT TCTGGTGCAC AGG 

(51) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GCAATGCAGG T CAT AG TG AG C 
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<52) INFORMATION FOR SEQ ID NO: 51: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TY&E: nucleic acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TY^E: DNA (genomic) 

( i i i ) HYPOTHET I CAL : \ YES 

(iv) ANTI -SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
10 TGGAGCATGG TGACGGGAAT GCAGAAG 

(53) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
!5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
20 GTGATGAGCA GGTCACTGAG CGCCAAG 

(54) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE : nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 

30 GCAATGCAGG CGCTTAACAT TAC 

(55) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
TTGGGTTACA ATCTGAAGGG CA 
(56) INFORMATION FOR SEQ ID NO: 55: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (iv) ANTI -SENSE: NO 



(i) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

ACTCCGTGTC CAGCAGGACT CTG 

(57) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
25 (iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

TGCGTGTTCC TGGACCCTCA CGTG 

(58) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 
CAGGC CTTGG ATTTTAATGT CAGGGATGG 
5 (59) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE : YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: SI 
GGAGAGTCAG CTCTGAAAGA ATTCAGG 
15 (60) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 
TGATGTGATG CCAGATACTA ATAGCAC 
25 (61) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
C CTGATT CAT TTAGGTGAGA TTGAGAC 
(62) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS : 

5 (A) XiENGTH : 22 base pairs 

(B) TVPE: nucleic acid 

(C) S*fcRANDEDNESS : single 

(D) TOCOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iv) ANTI-SENSE t NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GACAGGTACC TTGCCATCAA G 
(63) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

CTGCACAATG CCAGTGATAA GG 

(64) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



CTGACTTCTT GTTCCTGGCA GCAGCGG 



XREN-0050 -87- PATENT 

Iff' 1 ' ; (65) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE\ nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYP^ : DNA (genomic) 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 
10AGACCAGCCA GGGCACGCTG AAGAGTG 27 

(66) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
20 GATCAAGCTT CCATCCTACT GAAAC CATGG TC 32 

(67) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE : YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
30 GATCAGATCT CAGTTC CAAT ATTCACACCA CCGTC 35 

(68) INFORMATION FOR SEQ ID NO:67: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE : NO 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

CTGGTGTGCT CCATGGCATC CC 
(69) INFORMATION FOR SEQ ID NO: 68: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GTAAGCCTCC C AG AAC GAG A GG 
(70) INFORMATION FOR SEQ ID NO: 69: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
CAGCGCAGGG TGAAGCCTGA GAGC 
(71) INFORMATION FOR SEQ ID NO: 70: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GGCACCTGCT GTGACCTGTG CAGG 
5 (72) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
10 { D ) TOPOLOGY : 1 i ne ar 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71: 
GTCCTGCCAC TTCGAGACAT GG 
15 (73) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE : YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GAAACTTCTC TGCCCTTACC GTC 23 
25 (74) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI - SENSE : NO 
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! : 

^ (x i) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

CCAACACCAG CATCCATGGC ATCAAG 26 

(75) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 7 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iv) ANTI-SENSE: YE9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GGAGAGTCAG CTCTGAAAGA ATTCAG0 27 



